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(57) Abstract 

Provided are cyclized peptides with a con- 
strained region(s) having an o-helical conforma- 
tion. Constrained helical peptides having amino 
acid sequences from HIV gp41 are provided, as is 
their use in preparing antibodies^ that prevent viral 
membrane fusion. Also provideo^e-methods for 
making such cyclized peptides. 



EKELE 



KQNSN 



SLEESR 




YLQEW 



HSNL 



NHQED 



T1QLA 



BNSDOC1D: <W Q 9 820036A1 t > 



t 



Codes used fa identify 



FOB THE PURPOSES 
States party to the PCT on the front 



AL 


Albania 


AM 


Armenia I 


AT 


Austria 


AU 


Australia 


AZ 


Azerbaijan 


BA 


Bosnia and Herzegovina 


BB 


Barbados \ 


BE 


Belgium % 


BF 


Burkina Faso i 


BG 


Bulgaria 


BJ 


Benin 


BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Central African Republic 


CC 


Congo . 1 


CH 


Switzerland ] 


CI 


Cole d'Fvoire 


CM 


Cameroon 


CN 


China 


CU 


Cuba 


cz 


Czech Republic 


DE 


Germany 


DK 


Denmark 


EE 


Estonia 



ES 

Ff 

FR 

GA 

GB 

GE 

GH 

GN 

GR 

HU 

IE 

IL 

IS 

IT 

JP 

KE 

KG 

KP 

KR 

KZ 

LC 

LI 

LK 

LR 



Spain 
Finland 
France 
Gabon 

United Kingdom 

Georgia 

Ghana 

Guinea 

Greece 

Hungary 

Ireland 

Israel 

Iceland 

Italy 

Japan 

Kenya 

Kyrgyzstan 

Dem ocratic , People ' i 

Republic of Korea 

Republic of Korea 

Kazakstan 

Saint Lucia 

Liechtenstein 

Sri Lanka 

Liberia 



OF INFORMATION ONLY 

pages of pamphlets publishing international applications under the PCT. 



LS Lesotho 

LT Lithuania 

LU Luxembourg 

LV Latvia 

MC Monaco 

MD Republic of Moldova 

MG Madagascar 

MK The former Yugoslav 

Republic of Macedonia 

ML Mali 

MN Mongolia 

MR Mauritania 

MW Malawi 

MX Mexico 

NE Niger 

NL Netherlands 

NO Norway 

NZ New Zealand 

PL Poland 

PT Portugal 

RO Romania 

RU Russian Federation 

SD Sudan 

SE Sweden 

SG Singapore 



SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TM 


Turkmenistan 


TR 


Turkey 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United States of America 


uz 


Uzbekistan 


VN 


Viet Nam 


YU 


Yugoslavia 


ZW 


Zimbabwe 



BNSDOCID: <WO 9820036A1J_> 



10 



PCT/U397/20069 

WO 98/20036 

CONSTRAINED HELICAL PEPTIDES AND METHODS OF MAKING SAME 
FIELD OF THE INVENTION 

Tteta v»„o« re h 1M .o.hoo<»fo™^ 

to fining p«U. . . I** ™» >°" - — 10 t™' ™ 

5 prepanuio- of H.V v,=c te b»ed o„ HIV gp4, p.>yp=p.id= -I—- This mvooaoo >• 
Lprovod ™..hod S ft, HIV infooion d*»b »d tanooogens which iodoco antibody u»t». m .he 
diagnostic methods. 

BACKGROUND OF THE INVENTION 

A variety ofm ethods for stabilizing o-helical peptides have been described previously. Addition of 
fiuoroethanolor hexafluoroisopropanolhas gently been used to stabilize a-helices in aqueous solution 
Dimerizadonof*^^^ 

peptides have been stabilized by incorporating groups at the termini to stabilize the intrinsic M 
Naturallyoccurringcappingmotifsas well as organic templates have been used to stabilize a-hehces by end- 
nucleate Several non-cova.ent side chain constraints have been investigated for a-hehx stabthzatton, 
includinghydro P hobicinteraction S , S altbrid g es,and metal ion chelation by both natural and unnatural am.no 

Finally, a-helices have been stabilized by covalent side chain tether, Chorev et a!., Bioc hemistry , 
^ 5968 .5974(1991),Osapay^J^^ 

7oc 114: 6966-6973(1990),B ra cken^., lJ**m**. lift 6431-6432 (1994), and Houston 
I^Lkta l: 274-282 (1995) described the stabilization of a-helices by side chain to s>de cham 
iactamization. Ravi «al, J^^n^lM: , 05-1 09 (1983) and Jackson i^^. 
,13- 9391-9392 (1991) described the constraint of peptides by disulfide bonds between rescues. The 
naturaUv occurring peptide apamin has been used as a scaffold for the presentation of a-henca, pepude 
sequences constrained in helical conformation by disulfide bonds to scaffold cysteine rescues. 
, 5 Acquired immunodeficiency syndrome (AIDS) is caused by a retrovirus identified as the human 

" immU nodeficiencyvirus(HIV). 

,0^^ — -i-^rf-^-^'^ Recenteffortshaveusedsubunn 

vaccineswnereanHIVpr,^^ 

for safety reason, Subunit vaccines generally include gp 1 20, the portion of the HIV envelope protem winch 

30 is on the surface of the virus. . 

The HIV envelope protein has been extensively described, and the ammo acidand nucle.c ac.d 
^uencesencodmgHIVenvelopefromanumberofHIV strains are Known (Myers, G. etal., 1992. Human 
RetrovirusesandAIDS. A compilationand analysis of nucleic acid and amino acid sequences. Los Alarm* 
National Laboratory, Los Alamos, New Mexico). The H!V envelope protein is a ^^ ^Z 
35 kd ( gP 160) which is anchored in the membrane bilayer at its carboxyl terminal regton. The N-ternuna 

membrane protein gp4l. As there is no covalent attachment between gP 120 and gP 41, free gP 120 
sometimes released from the surface of virions and infected cells. 
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op 120 has been the object of intensive investigation as a vaccine candidate for subunit vaccines, as 

tneviralp^teinwfai^ 

MN strain are underway. 

„. . . i9ft or another HIV protein for protection agamst additional 

However, effective vaccines based on gp 1 20 or anomer m v H « r 

5 strainsofHIVarestillbeingsoughttopreventthespreadofthisdisease. 

SUMMARY OF THE INVENTION 
Themventionprovidesame^^ 
of.(l)synthes iZ ingapeptide,wh^^ 

thesequenceof eight amino acid residues has a first terminal residue and a second terminal residue, wherein 
10 mefirstterminalresidue^ 

second terminal residue has a side chain containing an amide bond-fonning substituent; (2) providing a 
dinmctional linker havingafirst functional p^c^-f^-^^^^^*-; 
amidebond-formingsubstituentof the first terminal residue and having a second functional group capable of 

15 formmganamidelmkagewim^^ 

and (3) cyclizing the peptide by reacting the side chain amide bond-forming substituent of me first te^al 

residuewuhthefirstf^^ 

chainauiidebond-formingsubstituentofthe.condterminal residue with the second Actional group of the 
Afunctional linker to form an amide linkage, yielding a constrained heiical peptide. 

The invention also provides a method for constructing a constrained helical peptide comprising the 
steps of: (1) synthesizing a peptide, wherein the peptide comprises a sequence of eight amino acid residues, 
whereinthe seouenceof eight amino acid residues has a first terminal residue and a second termina. residue 
wherein the first termina! residue and the second terminal residue flank an internal sequence of six amino acid 
residues,wherein^ 

the second terminal residue has a side chain containing an amide bond-forming substituent, and wherein the 
side.chauv amide bond-forming substituent of the first terminal residue is protected with a first protectmg 
group and the sidechamam^^ 
secondprotec^^ 

removable; (2) removing the first protecting group such that the side cham amide bond-fonnmg —t 
of the first terminal residue is deprotected and the side chain amide bond-forming substituent of the second 
terminalresid^ 

of formingan amide linkage with the side chain amide bond-forming substiment of me fu,ttermma^ 

andhavingasecond functional group capable of forming an amide J^^***~**-+ 

fonnmgsubstitu^ 
a^idelinkagebe^^ 

forming substituentofthefirsttermina, residue; (5) removing the secondprotectinggrouptodeprotectthes.de 
^ amide bond-forming substituent of the second terminal residue; and (6) cvcHzmg the peptide y 
intramolecularlyreacting the side chain amide bond-forming substituent of the second terminal residue with 
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the second functional group of the Afunctional linker to form an amide linkage and yield a constrained helical 
peptide. 

The inventionfurfher provides a method for constructing a constrained helical peptide, cotnpnsmg 
the steps of : (a) synthesizinga peptide, wherein the peptide comprises a sequence of eight amino acid residues, 

5 wherein the sequenceof eight aminoacid residues has a first terminal residue and a second term. nai residue, 
whereinthe first terminal residue and the second terminal residue flank an internal sequence of six ammo acd 
residues, wherein the first terminal residue has a side chain containingan amide bond-forming substituent and 
the second terminal residue has side chain containing an amide bond-forming substituent, wherein the first 
terminal residue is coupled to a Afunctional linker having a first functional group and a second funcnonal 

10 group, wherein the first functional group is in an amide linkage with the side chain amide bond-formmg 
substituentof the first terminal residue, and wherein the second functional group of the difunctional linker . 
capable of forming an amide linkage with the side chain amide bond-forming substituent of the second 
terminal residue; and (b)cyclizingthe peptideby intramolecularlyreactingthe side chain amide bond-formmg 

substimentoftte^ 

1 5 an amide linkage and yield a constrained helical peptide. 

The invention additionally provides a method for constructing a constrained helical peptide 
comprisingthe steps of: (1) synthesizinga peptide, wherein the peptide comprises a sequence of eight ammo 
acid residues, and wherein the sequence of eight amino acid residues has a first terminal residue and a second 
terminairesidue,whereinthe first terminal residue and the second terminal residue are independently selected 

20 from Asp and Glu; (2) providing a diamine linker having a first amino group capable of form.ng an am.de 
Unkagewith the carboxy side ch^ 

an amide linkage with the carboxy side chain of the second terminal residue; and (3) cyciizing the pepfde by 
reacting the first amino group of the diamine linker with the carboxy side chain of the first terminal resume 
to form an amide linkage and reacting the second amino group of the diamine linker with the carboxy stde 
25 chain of the second terminal residue to form an amide linkage, yielding a constrained hei.cal pept.de. 

The inventionalso encompasses a method for constructing a constrained helical peptide compmmg 
the stepsof: (1) synthesizinga peptide, wherein the peptide comprises a sequence of eightamino acid residues, 
wherein the sequence of eight amino acid residues has a first terminal residue and a second termmai residue, 
wherein the first terminal residue and the secondterminal residue flank an interna! sequenceof s.x ammo acd 
30 residues, wherein the first terminal residue and the second terminal residue are independently selected from 
Asp and Glu, and wherein the carboxy side chain of the first terminal residue is protected w.th a first 
protecting group and the carboxy side chain of the second terminal residue is protected with a second 
protecting group such that the first protecting group and the second protecting group are d t fferent.aiiy 
removable; (2)removingthefirstprotectinggrou P suchthatthe carboxy side chain of the firstterminalrestdue 
35 is deprotected and the carboxy side chain of the second terminal residue is not deprotected; (3) reactmg the 
peptide wimadiaminelinkerhavingafirstaminogroupandasecond amino group to form an am.de hnkage 
between the deprotected carboxy side chain of the first terminal residue and the first amino group of the 
diamine linker, (4) removing the second protecting group to deprotect the carboxy side chain of the second 
term.nalre S idue;and(5) cyclizingthe peptide by intramolecularlyreactingthe deprotected carboxy side cham 
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rt f the diamine linker to form an amide linkage 
of the second terminal residue with the second amino group of the diamine 

and vield a constrained helical peptide. 

L invoobo, —p^es , nradrod te co™ . «- P** 

ooorpnsinstho stops of, (0 synthesizing Pi* — * - **» 3 SSqUe ": T T d 

5 ^Zll <*L tha sc-ce of eight »». acid reside has . ft. — >-• — - ■ «- 

^LTft™, Asp «td * aod vrherch, the cboxy «. chain of*. « — »*. . coupiad „ 

^— — — ^•^^"Tr 

"j v „,„;th the first amino group of the diamine linker, and (2) cychzmg 
i n first terminal residue is in an amide linkage with the tost amino group 

^debyin^^ 

amino group of the diamine linker to form an amide linkage and yie.d a 

The invention also encompasses a compound selected from the group consistmg of: 

the compound represented by Formula (1): 
15 0=C-CNH)-(CH 2 ) n -CNH)-C=0 

- I • -■. 'I : 

(CH 2 ) m (CH 2 ) p 

| - I 

X-(NHMCH)-C Z— (NH>(CH)-C-Y-S , 

» 'I o 

„ herei „ S is absent or is ?L— *-* X is hydtogen or is any amino acid or amino ncid 
is absent, or is hydroxy, if S is absont, or is any amino acid or amino acd ~***Z 

rang. defi«d by CMmepB » CM-1>» W**. prosoded thai n is grcaterthan it 
the compound represented by Formula (6): 

30 ' : " ' li " 
H-N-C-(CH 2 V(NH)-C=0 

! I 
(CH 2 ) q <CH 2 ) S 

i 1 
35 X-(NH>(CH>C-Z-(>JHHCH)-C-Y-S , 

O o 

„h„ai» S is absent or is . X is hydn*. - is any amino acid or 

seq „™c.,Y is abson, or is hydmxy, if S is absan., or is any amino aoid or ammo ncd re^m* Z 

than 0; 
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the compound represented by Formula (11): 

O 
II 

0 =C-(NHMCH 2 ) U -C-N-H 

5 I I ■ ; ■ ■ 

. (CH 2 \ ( CH 2)v 

I I 
X-(NHHCH>C-Z-(NHHCH)-C-Y-S . 

II « 

o o 

wherein S is absent or is a macromolecule, X is hydrogen or is any amino acid or ammo acd 
sequence, Y is absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence, Z 
is any amino acid sequence consisting of six amino acids; t is selected from the integers 0 to 6 
inclusive, and v is selectenfrom the integers 1 to 7 inclusive, provided that t + v is less than or equal 
to 7; and u is any integer in the range defined by (7-(t + v» to (9-(t + v)) inclusive, provided that u ,s 

greater than 0; and 
the compound represented by Formula (16): 
O O 



10 



15 



H-N-C-(CH 2 ) X -C-N-H 

! i ■ . 

(CH 2 ) W (CH 2 )y 

I I 

25 X-(NH)-C-C~ Z--N-(CH)-C-Y-S , 

I II I .11 

HO H O 
(16) 

wherein S is absent or is a macromolecule, X is hydrogen or is any amino acid or ammo acrd 
30 sequenced is absent, or is hydroxylifS is absent, or is any amino acid or amino acid sequence,* 

is any amino acid sequence consisting of six amino acids; v, and y are independently selected from 
the integers 1 to 7 inclusive, provided that w + y is less than or equal to 8, and x is any Integer m the 
range defined by (7-(w + y)) to (9-(w + y» inclusive, provided that x is greater than or equal to 0. 
Inapreferredembodimentisprovidedacompoundcontaining a constrained helical pepade that m 
35 turncontainsapeptideof asequenceof eight amino acid residues, in which the sequence of eight ammo acd 
" residueshasafnsttennmalresidueandasecondtermmalresiduethatfia^ 

acidsand that haveaside chain that are linked to each other forming a locking moiety to form a constramed 
helical peptide. The interna, sequence of six amino acids has the form gabede, defgab, or cdefga and ts 
selected from the group of sequences consisting of asequence of six contiguous amino acids m HIV-iLAI 
40 strain^! amino acid sequence 633 to 678, in Us homolog sequence from anotherHIV strain, in a consensus 
sequenceof its homolog sequences from any one HIV Cade, or an amino acid substituted variant thereof, » 
which amino acid 633 or its corresponding amino acid in the homolog, consensus or variant sequence ,s 
assignedpositiona of a repeating abedefg assignment for the 633-678 sequence (as shown in Figure 18). In 
these compounds the locking moiety or tether is between adjacent f positions when the internal sequence « 
45 of ^f ml ^^^V^^^^'^^^^^^^ 
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prions when ft. h^l seque.ce is of .he fonn cdefge. Mo* pteferebiy the .0* » ^ ^ 

678 region. in o pntfe-red etubodhnen, ft. in.ame, sequence of sue andno ac.ds h» the form gabedc. n. 
compoundspreferablyhaveHIVano-fusogenicoraun-iufection.ctivily. 

pM a bte s^c.h^»yo«e.r.ftyc— 

Leal peptide seeies , ,0 Xn . shown in Figures 23A end 23B in ^binanon «*» any one or any 

Figures 23A and 23B. PepddesHIV 2 4 and H.V31 are r-mcrfariy purred compounds of - 
10 toanofterenftodhnen. .he compound, of ft. invention are used » h^ans, prefemb.y cached «. 

nrd^c^trfP^-^^-^^^^^^^^TTr 
popddesnt.y inchtdchutarauo. ftohedto, P— **— — *" — ™ 

15 wha^in -» .re expo*d to biood ptoducrs. The conned pep.tdes of the _ 

c„ serve ft. roie of a prophy.oc.ic vaccina, whe.0, .he host mises antibodies agafts, the pepad* 
invention, which then serve to neutral H,V virus* hy, for exuntple, inhibidng h*. HIV tnfccuon. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig , is a diagran, depitdftg the syndics* of peptide lb nnd 1c. Raagen. .'qt— *>% 
20 pipcridine/DMA; cage., » represeu* HjNCHjCH £H *■» <R=H or BOO, BOP, DIPEA, CH «. a 
^ . represents Pd(PPh 3 )4, 20% piperidine/DMA, (R-BOO TFA/CH 2 C, 2 /.oiso 1 a,.,2^.».=d i th tol 
45:45:5:5 v/y; reagent^ represents BOP, DIPEA, CH2*-''2' reagcnie 
0 "C; and reagent/represents CH 3 NH 2 , BOP, CH 2 C1 2 . 

Fig 2 is a diagram depicting the synthesis of N-Fmcc-S-Acn-D-thiolysine (compound 7). Reagent 

25 ^^n^BW^^^"^ 1 ^ 

• represents 0.25 M HCU THF/H 2 0; reagent d represents Hg(OAc) 2 , TFA; H.S; reagent . represents 
acetamidome^^ 

^XoaandSb are graphs depicting^ 

11— ,~a a t ->sn K nH 5 0 500 MHZ and a peptide concentration 
30 spectru.nofpept.delc The spectrum was collected at 280 K, pH 5.0, 5 

of 1 5 mM with a 4.5 kHz spin-lock mixing pulse of 200 ms durat.on. Lmes connect the ROEs by wh.ch 
sequential assignments were made. Rectanguiar, oval and diamond shaped boxes denote intra res.due, 
sequential and (/, 1+3) correlations, respectively. 

Fi g .4isagraphde P ic^ Forthed^andd^ 
35 rows, observation of the sequential ROE is indicated by a bar connecting two residues^the thickness °^^ ie 
bar indicatingthe relative intensity of the ROE. The downward pointing arrows indicate to less than 

o ™* <u a H N r 1+3 and H K -hP /, 1+3) are indicated by the lines m the 
6.0 Hz. Observed medium range ROEs (H -H 1,1+Jtman n , 
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lower part of the figure; dotted lines and stars indicate ROEs that could not be unambiguous* observed 
because of chemical shift degeneracy. The coil motif above the primary sequence indicates the reg.on deduced 
to have helical structure from the NMR data; the dashed coil indicates sections of peptide where only some 

of the NMR data indicate helical character. 
5 Fig. 5 is a molecular model depicting an ensemble of 20 rMD structures calculated using NMR data 

for peptide U. The structures were overiayed using the N. Ca and C atoms of residue Thrl to GlnlO. 
Backboneandside-chainheavy atoms are connected by solid and dotted lines, respectively. The side-chatns 
of Arg8 and Arg9 are truncated at C*, and all side-chain atoms of Glnl 1 and G!nl2 are omitted for clarity. 
Fig. 6 is a graph depicting the CD spectra of peptide U at 280, 310, 330, 350, and 370 K. 
10 Fig. 7 is a graph depicting the CD spectra of peptides 1 and 3 (Apamin-based sequences) at 280 K. 

Fig. 8 is a graph depictingthe CD spectra of peptides 2 and 4 (C-peptide-based sequences) at 280 K. 
Fig 9 is a graph depictingthe thermal denaturation profile of peptide le as determinedby CD spectra 
obtained before, during and after heating for 1 day at 87 «C. Circles indicate the initial spectrum obtained 
from a sample before heating; squares indicate the spectrum obtained from a sample at 87»C dunng 
1 5 incubation; triangles indicate the spectrum obtained from a sample after recooling to 7 °C at 0.2 'Omm. 

Fig. 1 0 is a graph depicting a section of the TOCSY spectrum of peptide 1c. The data were collected 
at 280 K pH 5 0, 500 MHZ and a peptide concentration of 1 .5 mM with a mixing time of 90 ms. The solid 
linesconnect cross-peaks between backbone amide and side chain protons; assignments are indicated at the 
top of each line. Dashed lines connect cross-peaks between the side chain amide protons of Gln3 and GlnlO 

20 and the methylene linker resonances. 

Fig 11 is a diagramdepictingthe synthesisof a locked helix species of the peptide Asn-Met-Glu-Gln- 
Gln-Arg-Arg-Phe-Tyr-Giu-Ala-Leu-His where the carboxy side chains of the Glu residues are covalently 

linked with a 1 ,5-pentanediamine linker. 

Figure 12 depicts sequences and schematic representations of the locked-helbc peptide embodiments 
25 of the invention. The cylinders recent «-helices, with the stippled faces corresponding to the 4, 3 
hydrophobic repeat. Covalent restraints linking sidechains at / and / +7 are represented as dark hues. 

Figure 1 3 is a circular dichroism spectra of peptides HIV24 (open squares), HIV30 (open crcles), 
HIV3 1 (closedcircles), and HIV35 (closed squares). Spectra were acquired at V C in 10 mM Tris-HCl, P H 

7.5(21). . 
30 Figures 14A and 14B are graphs depicting the effect of inhibitory peptides in primary mfect.vrty 

assays using PBMCs with virus JRCSF, an NSI strain (Figure -14A), and BZ167, an SI strain (Figure 14B) 
(22). HIV24 (closed triangles); HIV30 (open circles); HIV31 (closed circles); HIV35 (closed squares); 
DP 178, (open squares). 

Figure 15 is a schematicof a proposed mechanism for assembly of the fusogenic state of g P 41 (top) 
35 and inhibition by constrained peptides (bottom). 

Figures 16A to 16G present amino acid sequences of gp41 from known HIV virus strains and their 
consensussequencesbased on statistical amino acid frequency. Amino acids are represented by the standard 
singleiettercode. The strains within each HIVclade are presented. A "-" in a sequence represents the ammo 

~7- 
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corresponding position in viral sequences within that clade. An upper case amino acid in a consensus 
seq uence indicates that only that amino acid is found at that corresponding position in viral sequences 
within that clade. Strain designations with no sequence information indicate that the compete gP 41 

sequence has not been determined. 

Figure 17 is a summary of consensus sequences from known strains. The peptide sequence of 

DP178 b delineated. The nomenclature is the same as in Figures 16A to 16G. 

Figure 18 is a schematic presenting an alignment of sequences from clades A, B, C, D, and E 
consensus sequences, peptides DP 178, HIV35 and the Neurath peptide, in which the repeating heptad 
abcdefg assignment as taught herein is provided, and positions of some constraining locks are md.cated. 
For example, amino acids in the sequence ESQNQQ of DP178 are assigned positions g, a, b, c, d, and e, 
respectiveiy, and thus has the form gabcde, for purposes o the present invention. This sequence the 
internal sequence of six amino acids present in peptide HIV24, which is a single-lock form of the HIV35 
sequence. Locations of internal sequences of the invention are those found between locking res.dues, 
whose positions are indicated by the T symbols and each of which, in this example, correspond to 
assigned position f. Positions for placing either one, two or three locks in the representative presented 
sequences are shown. The figure delineates five gabcde form helical sections suitable for locking when 
lo cks occur at adjacent f position, Also shown are locations of gabcde form helical sections when one, 
two or three i to H-7 iocks are present in a 633-678 sequence or variant thereof.. The two-lock vanants are 
.abeled (II), (111), HIV31. (VI) and (VII), and the one-lock vanants (VIII), (IX), HIV24, (XI) and (XII). 

Three-lock variant is labeled (I). 

Figure 19 is a helica! wheei representation of the representative gp41 fusion pepttde sequence 
from the H1V-1 LAI strain, showing the "abcdefg" heptad reading frame and the heptad repeat pattern as 
assigned herein (see Figure 18) for the purposes of the present invention. 

Figure 20 is a schematic depicting the use of the compounds of the invention as haptens for 
immunization and shows the gp41 core trimer, Its DP178 binding groove and the 633-678 region that 
• binds this grove. Hapten a presents the 4,3 repeat surface from HIV 24 in a constrained helix. Th, face 
would presumably never be exposed to the immune system since it likely does not form until exposed to 
the trimer -groove" binding site. . An antibody raised to it, however, would essentially be an anu-tdtotyptc 
antibody to the trimer grooves. When the resting state gP 4, is exposed to this antibody it would mduce 
the C-terminus of gP 4. to form a helix in a non-productive state, „ bound to the antibody, t us 
sequestering the protein off the fusion pathway. Antibodies to Hapten a are expected to bmd to the 
indicated regiori, ihducmg this region to form a helix m 
assembly of the fusogenic helical bundle. 

Figure 21 is a schematic depicting a proposed mechanism for antibody intervention m HIV v,ral 

infectivity. - in . n 

Figure 22 is presents a consensus sequence of the HIV gp41 sequences from Ftgure 17 wtth all 
aliowed amino acid substitutions in each position listed. For example, at the fifth amino acid posinon 
(starting from the N-terminal amino acid (left end)), the amino acids E (glutamic acid), D (asparUc acid) 
and K (lysine) are allowed without disrupting H-bonding, thus without disrupting helicity or sigmficantiy 
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interfering with the peptide's interaction with the core coiled-coil trimer of gp41 . "X" indicates positions 
that can be substituted with any non-helix breaking amino acid. The repeating heptad abcdefg ass lg nment 
for each amino acid position in the 633 to 678 sequence, for purposes of the present invention, is shown. 
The »*" indicate b, c, and f positions that, when not used for locking the helix, can be replaced with a non 
helix-breaking amino acid without significantly disturbing H-bonding, helicity and trimer groove binding. 

Figure 23 presents a shorthand notation of specific peptides in peptide series I through XII (as in 
Figure 18), indicating locking positions, amino acid substitution variant peptides, and truncation variant 
peptides of each. The "X" indicates a position that can be substituted with any non helix-breaking amino 
acid, 
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Figure 23 presents a shorthand notation of specific peptides in peptide series I through XII (as in 
Figure 18), indicating locking positions, amino acid substitution variant peptides, and truncation variant 
peptides of each. The "X" indicates a position that can be substituted with any non helix-breakingamino acid, 
but preferably with an amino acid present in that position from any one of the known HIV sequences shown 
5 in Figure 1 6. "B" indicates a position used for the bridging (or tethering or locking) residues. Preferred f 
positions are presented for locking; however in less preferred embodiments the c and some b positions can 
be used for locking. As in Figure 18, locations of internal sequences relevant to the inventions those found 
between lockingresidueswhosepositionsareindicatedby the T symbols and correspond to assigned position 
f T in this example. Positions for placing either one, two or three locks in the representative presented 

10 sequences^ shown. Th^ 

occur at adjacent f positions. The «.» indicates positions that can be optionally absent from the final 

cons^inedhelicalpeptidecompoundwithoutsubstantiaHyeffectmgmeheHcalproperti^ 

properties of the final constramedhelical peptide. For example, a peptide based on peptide I, having the three 

locks placed as indicated, can optionally lack any one or all of the five N-terminal amino acids WXXWE, 

15 which are marked by a"." Further, another series of truncated variants is indicated in the figure-C-terminal 
truncated variants-sincethe five C-terminal residues (LWMWF) are marked with a can be absent. When 
the 4ck is placed more centrally in the 633-678 sequence, as shown in peptide series II, peptides in this series 
can lack additional amino acids at the C-terminal end as indicated by the marked positions. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

20 A. DEFINITIONS 

j Aminoacidsand aminoacid residues described herein may be referred to according to the accepted 
onelrthreelenercodeprovidedinthetablebelow. Unless otherwise specified, these amino acids or residues 

are of the naturally occurring L stereoisomer form. 

.] Three-Letter 
25 ] Common Name One-Letter Symbol Symbol 



-A 

-p 
i 



30 



35 



40 



45 



Alanine 


- A 


Arginine 


R 


Asparagine 


N 


Aspartic acid 


D 


Cysteine 


C 


Giutamine 


Q 


Glutamic acid 


E 


Glycine 


G 


Histidine 


H 


Isoleucine 


I 


Leucine 


L 


Lysine 


K 


Methionine 


M 


Phenylalanine 


F 


Proline 


P 


Serine . 


S 


Threonine 


T 


Tryptophan 


W 


Tyrosine 


Y 



Ala 

Arg 

Asn 

Asp 

Cys 

Gin 

Glu 

Gly 

His 

He 

Leu 

Lys 

Met 

Phe 

Pro 

Ser 

Thr 

Trp 

Tyr 
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In general, unless otherwise specified, the abbreviations used for the designation of amino acids and the 
protective groups used therefor are based on the recommendations of the IUPAC-IUB Commission of 
Biochemical Nomenclature (Bio^h^nisiry, 11: 1726-1732 (1972)). 

As used herein, the term -(CH 2 )„- is used to denote a straight chain .alky I substituent of n carbons in 
5 length, wherein -(CH 2 ) 0 - is defined as a chemical bond, i.e. indicating that no aikyl substituent is present, - 
(CH,),- is defined as a methyl substituent, -(CH 2 ) 2 - is defined as an ethyl substituent, etc. 

As used herein, the term "C,-C 6 aikyl " means a saturated aliphatic hydrocarbon substituent having 
the number of carbon atoms specified. C r C 6 alkyl encompasses cyclic and straight chain hydrocarbons, 
unbranchedand branched hydrocarbons, substituted and unsubstituted hydrocarbons, and primary, secondary 
10 and tertiary hydrocarbon substituents. Representative examples of these alkyl substituents include methyl, 
fluorenylmethyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, sec-butyl, tert-butyl, n-pentyl, 2-methylbutyl, 
2,2-dimethylpropyl,n-hexyl,2-methylpentyl,2^-di m ethylbutyl, cyciohexyl, and the like. The terms "tower 
alkyl", "simple alky!" and "C r C 6 alkyr are synonymous and used interchangeably. 

As used herein, the terms "peptide", "polypeptide", and "protein" are used synonymously and refer 
15 to any proteinaceous compound comprising an amino acid sequence of two or more amino acid residues. 

As used herein, an "amide bond-forming substituent contained in an amino acid side chain", a "side 
chain amide bond-forming subsituent", and their grammatical variants, are defined to include (1) any carboxy 
substituent contained in the side chain ("R" group) of an aminoacid wherein the carboxy substituent is capable 
of forming an amide linkage with an amino group contained in another molecule, i.e. the carboxy substituted 
20 reacts with an amino group contained in another molecule to form an amide linkage; and (2) any amino 
substituentcontainedin the side chain <"R" group) of an amino acid wherein the amino substituent is capable 
of forming an amide linkage with a carboxy group contained in another molecule, i.e. the amino substituted 
reacts with a carboxy group contained in another molecule to form an amide linkage. 

As used herein, "differentiallyremovable" protecting or protective groups are defined as any pair of 
25 protective groups capable of protecting a first amide bond-forming substituent and a second amide bond- 
formingsubstituent, wherein it is possible to deprotectthe first amide bond-forming substituent protected with 
one member of the pair under conditions which do not deprotect the second amide bond-forming substituent 
protected with the other member of the pair. Differentially removable protecting groups are also referred to 
herein as "orthogonal" protecting groups, and the differentially removable protection conferred by such 
30 protective groups is referred to herein as "orthogonal" protection. 

The term "epitope" as used herein, designates the structural component of a molecule that is 
responsiblefor specific mteractionswtmcorre S pondingantibody(immunoglobulm) molecules elicited by m^ 
same or related antigen. More generally, the term refers to a peptide having the same or similar 
immuno re acuveproperties,such ^ 
35 to generated antibody. Therefore, an epitope that is formed by a specific peptide sequence generally refers 
to any peptide which is reactive with antibodies directed against the specific sequence. 

The term "antigen" as used herein, means a molecule which is used to induce production of 
antibodies. The term is alternatively used to denote a molecule which is reactive with a specific antibody. 
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The term "immunogen" as used herein, describes m entity that indues antibody production in a host 
animal. In some instances the antigen and the tann^^m^^™^?"^^^ 
entities are different. 

The term "subunit vaccine" is used herein, as in the art, to refer to a viral vaccine that does not coniain 
5 virus butramer contains one ormor* viral proteins orfragmen* of proteins. As used herein, the term 
.■muhivalent-meansthatthe vaccine 

sequence from at least two HIV isolates having different ammo acid sequences. 

The term "breakthrough^^ usedherein, as in the art, to referto a v,rus 

isolated from a vaccinee. 

10 B. GENERAL METHODS 

In generate invention provides a method for removing elements of «-helical secondary structure 

one aspect, the method is useful for artificially reconstructing and characterizing** binding deternunants that 
existwithinan a-helica. bidding domain of a protein of interest. The design of molecules which are capable 
15 of binding competitive* at a protein interface requires the ability to mimic the higher level structure of the 
natural ligand. If the ligand's Structure at the site of protein interface can be mimicked with a short pept.de, 
then the peptide can be used to determine Whether it is feasible to design small molecules that competinvely 
bind at the protein interface. A short' peptide's ability to competewith the natural ligand for binding at the 
proteininterfa^^ 

20 that the short peptide could be used as a model for designing small molecules that compete with the natural 
ligand for binding at the protein interface. 

-In another aspect, the method, of the invention are used to stabilize the conformational structure of 
a protein or peptide. The pr^nt methods can be employed in place one (or more) a-hehcal 

determinantts) of interest m a ptotein or peptide such that the protein (qr peptide) retains an «-hehcal 
25 conformation in enVh^hmehts Or conditions that would ^stabilize or deteriprate the a-helica. secondary 
structure of an unconstrained protein Or peptide species. 

The rhethods of the invention are also useful for the replication of protein function without an mtact 
protein or intact functional domain. For example, the replication of a protein's binding actmty by a 
constmined helical peptide of the invention would allow the use of affinity pupation procedures for the 
30 protein's ligand MT K&*m*«>~W^*^*^**'* ^ * 
helical peptide ****** ^Vm^*^)^^^!"*"* « C °* 

preventmgmeuseofmeptOtemm^ 

possessmgmecohformatiOn,! m^m^^^^m^im^^^^'^ 
a conformationaiepitopefrom the test of the protein and raise antibodies against the single epitope of mterest 

35 without hmhns&tmsm other WW* •*» adafcgto^^P*** 1 - 

Particularly ptefefted are the use of the compounds of the invention having constrained hehcal 
jp^^^l^WM**^^^™ «« amino acid sequence 633-678 

and homologs thereof, for use as haptens, vaccines, and in diagnostics. 
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In another aspect, the methods and peptides of the invention can be used to create combinatorial 
constrained helical peptide libraries that are useful in chemical selection systems. 
I. Locked Helix Peptides and Uses Therefor 
The invention provides locked helix peptides of formula ( 1 ): 
5 0=C-(NH)-(CH 2 ) n -(NH)-C=0 

I I 
(CH 2 ) m (CH 2 ) p 

I I 
X-(NH)-(CH>C Z~ (NHMCH)-C-Y-S 

io li " 

o o 

(1) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 
15 Z is any amino acid sequence consisting of six amino acids; m and p are independently selected from the 
integersO to 6 inclusive, provided that m + p is less than or equal to 6; andn is any integer in the range defined 
by (7-(m+p» to (9-(m+p)) inclusive, provided that n is greater than 1 . 

In another embodiment, the invention provides locked helix peptides of formula (2): 
0=C-(NHMCH 2 ) n -(NH)-C=0 

20 I I 

| CH 2 

I • I 

CH 2 CH 2 

I I 
25 X-(NH>(CH>C— Z--(NH)-(CH>C-Y-S 

II II 

o o 

(2) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
30 absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 

Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
In yet another embodiment, the invention provides locked helix peptides of formula (3): 

0=C-(NHMCH2) n -(NHVC=0 

I I 
35 CH 2 . I 

I I 
CH 2 CH 2 

I I 
X-(NH)-(CH)-C Z--(NH>(CH)-C-Y-S 

40 II I' 

o o 

(3) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 
45 Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (4): 



42- 



BNSDOC1D; <W O 9 Sa0036A1 J_> 



10 



PCT/US97/20069 

WO 98/20036 



0=C-(NH)-(CH 2 ) n -(NH)-C=0 

CH 2 CH 2 
1 I 
CH 2 CH 2 

I ! 
X-(NH)-(CH)-C— Z— (NH)-(CH)-C-Y-S 

o 0 

(4) 

w^Sisabsentorisama^^ 

absent, or is hydroxy! if S is absent, or is any amino acid or amino acid sequence; ^ 
Z U any amino acid sequence consistmg of six ammo acids; and n is any mte e er from 3 to 5 mc us,ve. 
in still another embodiment, the invention provides locked helix peptides of formula (5). 
0^-(NHMCH 2 V(NH)-O=0 

X-(NH)-(CH)-C— Z--(NH)<CH)-C-Y-S 
o 0 

wherein Sisabse„torisam^^ 

absent, or is hydroxy, if S is absent, or is any amino acid or amino acid sequence; 
Z is any amino acid sequent consisting ^ " ™*" 

in still pother embodiment 

' O. ,.. ■ ' 



15 



20 



25 



•■• - H-N-C-(CH 2 ) r -(NH)-C-0 y 

(CH 2 )q 
| I 

X-(NH>(CH)-C-Z-(NH)-(CH)-C-Y-S 

o o : " 

35 

(6) 

™ n w u i e - X is hvdrogen or is any amino acid or amino acid sequence; Y is 
wherein S is absent or is a macromolecule, a is nyorogen u j 

a bsent,ori S hydrox y lifSi S absent,orisany a mmoacidoraminoacidsequence; e 
Zfcany amino acid sequence consistm g of six ammo acids; q is selected * 
40 ^sLectedfrommeinte^ 

in still another embodiment, the invention provides locked helix peptides of formula (7). 
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20 



0 



H-N-C-(CH 2 ) n -(NH)-C=0 

[ I 
CH, CH 2 

I " I 
X-(NH)-(CH>C-Z-(NH>(CH)-C-Y-S 



O O 
(7) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 
15 Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (8): 
O 



H-N-C-(CH 2 ) n -(NH)-CO 



CH 2 i 

I I 
CH 2 CH, 

I I 
25 X-(NH)-(CH)-C-Z--(NH)-(CH)-C-Y-S 

II !l 
o o 

(8) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
30 absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 

Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (9): 
O 

SI 

35 H-N-C-(CH 2 ) n -(NH)-C=0 

I I 
CH, CH 2 

I " I 
CH, CH, 

40 I I 

X-(NH)-(CH)-C-Z--(NH)-(CH)-C-Y-S 

. II 11 
O O 
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wherein S is absent or is a macromolecuie; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydroxy! if S is absent, or is any amino acid or amino acid sequence; 
Z is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (10): 

5 O 

II 

H-N-C-(CH 2 ) n -(NH)-C=0 

I I 
CH 2 CH 2 

I I 
X-(NH)-(CH)-C-Z-(NH)-(CH)-C-Y-S 



10 



(10) 

1 5 wherein S is absent or is a macromolecuie; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 
Z is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive, 
in still another embodiment, the invention provides locked helix peptides of formula (1 1): 

O 

20 II 

0=C-(NH>(CH 2 ) u -C-N-H 

I I 
(CH 2 ) t (CH 2 ) V 

I I 
25 X-(NH)-(CH)-C-Z-(NH)-(CH)-C-Y-S 



O o 

(ID 

wherein S is absentor is a macromolecuie; X is hydrogen or is any amino acid or amino acid sequence; Y is 
30 absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 

Zis any amino acid sequence consisting of six amino acids;tis selected from the integers 0 to 6 inclusive, and 
v is selected from the integers I to? inclusive, provided that t+v is less than or equal to 7; and u is any integer 
in the range defined by (7-(t+v)> to (9-(t+v)) inclusive, provided that u is greater than zero. 

In still another embodiment, the invention provides locked helix peptides of formula (12); 
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10 



40 



O 
II 

0-C-(NH)-(CH 2 ) n -C-N-H 

I * I- 

| CH, 

I I 
CH 2 CH 2 

i I 

X-(NH)-(CH)-C»Z-(NH)-(CH)-C-Y-S 



O O 
(12) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydroxy! if S is absent, or is any amino acid or amino acid sequence; 
15 Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (13): 

O 
II 

0=C-(NH>(CH 2 ) n -C-N-H 

20 II 

CH 2 I 

i i 
CH 2 CH 2 

I I 
25 X-(NH)-(CH)-C-Z-(NHHCH)-C-Y-S 

II II 

o o 

(13) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
30 - absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 

Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (14): 

O 
II 

35 OC-(NH)-(CH 2 ) n -C-N-H 

I I 
CH 2 CH 2 



X-(NH)-(CH)-C-Z~(NH)-(CH)-C-Y-S 

I! II 
o o 

(14) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence: 
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Z is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (15): 

O 

il ' 

5 0=C-(NH)-(CH 2 VC-N-H 

I I 
CH 2 CH, 

I I 
CH 2 CH 2 

I I 
X-(NH)-(CH)-C"Z»(NH)-(CH)-C-Y-S 



10 



O O 
(15) 

15 wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydroxyl if S is absent, or is any amino acid or ammo acid sequence; 
Z is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (16): 

O o 



20 



H-N-C-(CH 2 ) X -C-N-H 



(CH 2 ) W (CH 2 ) y 

I . I 
25 X-(NH)-C-C--Z— -N-(CH)-C-Y-S 

111 , I II 
HO H O 

(16) ^ 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
30 absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 

Z is any amino acid sequence consisting of six amino acids; w and y are independently selected from the 
integers 1 to 7 inclusive, provided that w+y is less than or equal to 8; and x is any integer in the range defined 
by (7-(w+y)) to (9-(w+y)) inclusive, provided that x is greater than or equal to 0. 

In still another embodiment, the invention provides locked helix peptides of formula (17): 
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0 o 

il il 

H-N-C-(CH^) n -C-N-H 

I I " 

I I 
CH 2 CH 2 

I I 
X-(NH)-C-C-— Z— N-(CH)-C-Y-S 

I II I H 

HO H O 

(17) 

wherein S is absent or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
absent, or is hydrpxyl if S is absent, or is any amino acid or amino acid sequence; 
Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive, 
in still another embodiment, the invention provides locked helix peptides of formula (1 8): 

O o 

II II 

H-N-C-(CH 2 ) n -C-N-H . ... 

1 I 
CH 2 1 

1 I 
CH-> CH 2 

! " i 

25 X-(NH>C-C---Z----N-(CH)-C-Y-S 

I II I II 

HO H O 

(18) v 
wherein S is absentor is a macromolecule; X is hydrogen or is any amino acid or amino and sequence; Y is 

30 absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 
- - Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (19): 
O O 

II II 
35 H-N-C-(CH 2 ) n -C-N-H 

I - - , J . 
GH 2 CH 2 

I I 
X-(NH)-C-C— Z--N-{CH)-C-Y-S 

40 I II I II 

HO H O 

wherein S is absentor is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Y .s 
absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 
45 Z is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 
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In still another embodiment, the invention provides locked helix peptides of formula (20): 

O o 

II II 
H-N-C-(CH 2 ) n -C-N-H - 

5 I "I 

CH 2 CH 2 

I I 
CH 2 CH 2 

i ! 

to X-(NH)-C-C— Z--N-(CH)-C-Y-S 

I II I II 

HO H O 

wherein S is absentor is a macrimolecule; X is hydrogen or is any amino acid or amino acid sequence; Y is 
1 5 absentjor is hydroxyl if S is absent, or is any amino acid or amino acid sequence; 

2 is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 

In a further embodiment, the invention provides locked helix peptides of formula (1), formula (2), 
formula (3), formula(4), formula(5), formuIa<6), formula<7), formula(8), formula(9), formula(lO), formula 

(nXfonnuIaO^formu^ 
•>0 (19) and formuia(20) whereiriX, Y, and Z collectively contain up to or about 35 amino acids (i.e. locked helix 
peptides of formulas (1), (2), (3), (4), (5), (6), (7), (8), (9), (10), (1 1), (12), (13), (14), (15), (16), (17), (18), 

(19) and (20) each of which contains a total of no more than or about 35 amino acid residues). 

Alsoprovidedhereinarelockedhelixpeptidesofformula(l),formula(2),for m ula(3), formula(4), 

formula (5), formula (6), formula (7), formula (8), formula (9), formula (10), formula (1 1), formula (12), 
25 formuM(13),formuIa(14), formula(!5), formula(16), formula(17), formula(18), formula (19) and formula 

(20) wherein X and/or Y contains) up to or about 30 amino acid residues. 

| Further provided he^in are locked helix peptides of formula (1), formula (2), formula (3), formula 
(4), formula^), formula(6), formula{7), formula<8), formula (9), formula (10), formula (1 1), formula (12), 
forrmjla<13), formula (14), formula (15), formula(16), formula(17), formuia(18), formula (19) and formula 
30 (20) wherein X and/or Ycoiitain(s) up to or about 25 amino acid residues. 

1 Additionally provided herein are locked helix peptides of formula (1), formula (2), formula (3), 
formula (4), formula (5), formula (6), formula (7), formula (8), formula (9), formula (10), formula (11), 
form^tt), formula 

and formula (20) wherein Xand/or Y contain(s) up to or about 20 amino acid residues. 
35 : Alsoencompassedheremarelockedhelixpe^^ 

(4),formula(5), formula (6), formula (7), formula(8), formula(9), formula (10), formula (11), formula (12), 

formula(13), formula(14), formula(lS), formula(16), formula(17), formula(3 8), formula (19) and formula 

(20) wherein X and/or Y contain(s) up to or about 15 amino acid residues. 

Further encompassed herein are locked helix peptides of formula (1), formula (2), formula (3), 
40 formula (4), formula (5), formula (6), formula (7), formula (8), formula (9), formula (10), formula (11), 
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fonnu!a(12Uo™ula<13),formula(^ 

and formula (20) wherein X and/or Y contain(s) up to or about 10 amino acid residues. 

Additionally encompassed herein are locked helix peptides of formula ( 1 ), formula (2), formula (3), 
formula (4), formula (5), formula (6), formula (7), formula (8), formula (9), formula (10), formula (11), 
5 fonnuia(12),formula(13),formula(14^^ 

and formula (20) wherein X and/or Y contain(s) up to or about 5 amino acid residues. 

Also within the scope of the invention are locked helix peptides of formula (1), formula (2), formula 
(3), formula(4), formula(5), formula (6), formula (7), formula (8), formula (9), formula (1 0), formula (1 1), 
formula(12),formula(13),formula(14^ 
1 0 and formula (20) wherein X and/or Y contains) up to or about 3 amino acid residues. 
The invention also provides locked helix peptides of formula (la); 
0=C-(NH)-(CH 2 ) n -(NH)-C=0 



15 



30 



(CH 2 ) m (CH 2 ) p 

I I 
X-(NH)-(CH)-C Z-— (NH)-(CH)-C-Y 



O o 

(la) . 
20 wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acid or 

amino acid sequence^ isany amino acid sequence consistingof six amino acids; m and p are independently 
selected from the integers 0 to 6 inclusive, provided that m+p is less than or equal to 6; and n is any integer 
in the range defined by (7-(m+p)) to (9-(m+p)) inclusive, provided that n is greater than 1 . 

In another embodiment, the invention provides locked helix peptides of formula (2a): 
25 0=C-(NH)-(CH 2 ) n -(NH)-C=0 

I 1 
I CHv 



CH 2 CH 2 

I I 
X-(NH)-(CH)-C-~-Z»-(NH)-(CH)-C-Y 



o o 

(2a) • 
35 wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is anynmino acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In yet another embodiment, the invention provides locked helix peptides of formula (3a): 
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O=C-(NH)-(CH^) n -(NH)-C=0 

I • I 

CU-y ! 
I " - i 

5 CHo CH 2 

I " i 
X-<NH)-(CH)-C Z--(NH)-(CH)-C-Y 

II i! 
o o 

10 (3a) , . „ 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxy! or is any ammo add or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (4a): 
15 0-C-(NH)-(CH 2 ) n -(NH)-C=0 

I I 
CH 2 CH 2 



20 



CH-, CH 2 

I " I 
X-(NH)-(CH>C Z--(NH)-(CH)-C-Y 



O o 
(4a) 

25 wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
3 to 5 inclusive. . 

In still another embodiment, the invention provides locked helix peptides of formula (5a): 
0=C-(NH)-(CH 2 ) n -(NH)-C=0 

30 



35 



CH 2 CH 2 

I I 
X-(NH)-(CH)-C Z— (NH)-(CH>C-Y 

II II 

o o 

(5a) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
5 to 7 inclusive. 

40 In still another embodiment, the invention provides locked helix peptides of formula (6a): 
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H-N-C-(CH 2 ) r -(NH)-C=0 

I I- 
5 (CH 2 ) q (CH 2 ) S 

X-(NH)-(CH)-C-Z— (NH)-(CH)-C-Y 
II il 

o o 

10 ( 6a ) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; q is selected from the 
integers 1 to 7 inclusive, and s is selected from the integers 0 to 6 inclusive, provided that q+s is less than or 
equal to 7; and r is any integer in the range defined by (7-(q+s)) to (9-(q+s)) inclusive, provided that r is 
1 5 greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (7a): 
O 



20 



H-N-C-(CH 2 ) n -(NH)-C=0 

I I . 

| CH 2 

i I 
CH, CH 2 

I " I 
25 X-(NH)-(CH)-C-Z--(NH)-(CH)-C-Y 

II. II 

o o 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acid or 
30 amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (8a): 

O 
II 

35 H-N-C-(CH 2 ) n -(NH)-C=0 

I I 
CH^ j 

I " I 
CH 2 CH 2 

40 I I 

X-(NH)-(CH)-C-Z~(NH)-(CH)-C-Y 
II II 

o o 

(8a) 
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wherein X is hydroeen or is any amino acid or amino acid sequence; Y is hydroxyl or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (9a): 

5 O 

II 

H-N-C-(CHo) n -(NH)-C=0 

I I 
CH, CH 2 

I " I 
CH 2 CH 2 

I I „ 

X-(NH)-(CH)-C~Z--(NH)-(CH)-C-Y 



10 



15 



o o 

wherein X is hydrogen or is any atntao acid or amino acid sequence; Y is hydroxyl or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
3 to 5 inclusive. 

20 In still another embodiment, the invention provides locked helix peptides of formula (10a): 

o 

II 

H-N-C-(CH 2 ) n -(NH)-C=0 

I . I 
25 CH 2 CH 2 

■ I ■ I ■ 

X-(NH)-(CH)-C-Z"(NH)-(CH)-C-Y 



O o 
, n (10a) 

wherein X is hydrogen oris any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acd or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (1 la): 



35 



40 



O 

■• II .... 

0=C-(NH)-(CH 2 ) u -C-N-H 

I I 
(CH 2 ) t (CH 2 ) V 

1 I 
X-(NHMCH)-C"Z~(^HMCH)-C-Y 

II II 

o o 

(11a) 
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10 



wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxy! or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; t is selected from the 
integersO to 6 inclusive. and v is selected from the integers 1 to 7 inclusive, provided that t+v is less than or 
equal to 7; and u is any integer in the range defined by (7-(t+v)) to (9-(t+v)) inclusive, provided that u is 
greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (12a): 

O 

II 

0=C-(NH)-(CH 2 ) n -C-N-H 

I I 
I CH 2 



CH 2 CH 2 

I I 
15 X-(NH)-(CH)-C-Z--(NH)-(CH)-C-Y 

II II 

o o 

(12a) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxy! or is any amino acid or 
20 amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (13a): 

O 
II 

25 0=C-(NHMCH 2 ) n -C-N-H 

I I 
CH 2 I 



CH, CH 2 

30 | I 

X-(NH)-(CH)-C-Z--(NH)-(CH)-C-Y 

II II 
O O 

03a) . 
3 5 wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (14a): 
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0=C-(NH)-(CH 2 ) n -C-N-H 

! 1 
5 CH 2 CH 2 

I I 
X-(NH)-(CH)-C--Z"(NH)-(CH)-C-Y 

II II 

o o 

10 (14a) . 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any amino acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (15a): 

O 

il 

0=C-(NH)-(CHo) n -C-N-H 



15 



CH-, CH 2 

20 | I ' 

CH 2 CH 2 

t I 
X-(NH)-(CH)-C-Z-(NH)-(CH)-C-Y 

II Si 

25 O O 

(15a) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
3 to 5 inclusive. 

30 ia still another embodiment, the invention provides locked helix peptides of formula ( 1 6a): 

- • o o — ■ ••• 

. II • il - 

H-N-C-(CB,) X -C-N-H 

I I 
35 (CH 2 ) W (CH 2 ) y 

I i 
X-(NH)-C-C--Z— N-(CH)-C-Y 

I II I II 

HO H O 
40 (16a) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; w and y are independently 
selected from the integers 1 to 7 inclusive, provided that w+y is less than or equal to 8; and x is any integer 
in the range defined by (7-(w+y)) to <9-(w+y)) inclusive, provided that x is greater than or equal to 0. 
45 i n still another embodiment, the invention provides locked helix peptides of formula ( 1 7a): 
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0 o 
I! II 

H-N-C-(CH 2 ) n -C-N-H 

I I - 

5 I CH 2 

I I 

1 1 

X-(NH)-C-C— -Z-— N-(CH)-C-Y 

10 I II I II 

HO H O 

(17a) . 
wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any amino acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
15 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (1 8a): 

o o 

II I! 

H-N-C-(CH 2 ) n -C-N-H 

20 I i 

CH 2 I 

I I 
CH2 CH-> 

1 i 

25 X-(NH)-C-C~-Z~~ N-(CH)-C-Y 

I II I II 

HO H O 

(18a) . 
wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acid or 

30 amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (1 9a): 

o o 

II IE . 

35 H-N-C-(CH 2 ) n -C-N-H 

CH 2 CHo 

1 I 
X-fNH)-C-C«~ Z— -N-(CH)-C-Y 

40 I II I li 

HO H O 

(19a) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxyl or is any ammo acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
45 5 to 7 inclusive. 
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In still another embodiment, the invention provides locked helix peptides of formula (20a): 
O O 

II II 
H-N-C-(CH 2 ) n -C-N-H - 

5 I I 

CHo CH 2 

I " I 
CH 2 CH 2 

I I 
10 X-(NH)-C-C — Z--N-(CH)-C-Y 

I II I II 

HO H O 

(20a) . 

wherein X is hydrogen or is any amino acid or amino acid sequence; Y is hydroxy! or is any amino acid or 
1 5 amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
3 to 5 inclusive. 

In a further embodiment, the invention provides locked helix peptidesof formula (la), formula (2a), 
formula (3a), formula (4a), formula (5a), formula (6a), formula (7a), formula (8a), formula (9a), formula 
(10a), formula (lla), formula (12a), formula (13a), formula (14a), formula (1 5a), formula (16a), formula 
20 ( 1 7a), formula (1 8a), formula ( 1 9a) and formula (20a) wherein X, Y, and Z collectively contain up to or about 
12 amino acids (i.e. locked helix peptides of formulas (la), (2a), (3a), (4a), (5a), (6a), (7a), (8a), (9a), (10a), 
(lla),(12a), (13a), (14a), (15a), (16a), (17a), (18a), (19a) and (20a) each ofwhich contains a total of no more 
than about 12 amino acid residues). 

Alsoprovidedherein are lockedhelix peptides of formula (la), formula (2a), formula (3a), formula 
25 (4a), formula (5a), formula (6a), formula (7a), formula (8a), formula (9a), formula (1 0a), formula (1 la), 
formula (12a), formula (13a), formula (14a), formula (15a), formula (16a), formula (17a), formula (18a), 
formula ( 1 9a) and formula (20a) wherein X and/or Y contains) up to or about 30 amino acid residues. 

Furfoer providedherem are bcked h^ 
(4a), formula (5a), formula (6a), formula (7a), formula (8a), formula (9a), formula .{ 10a), formula (lla), 
30 formula (12a), formula (I3a), formula (14a), formula (15a), formula (16a), formula (I7a), formula (18a), 
formula (19a) and formula (20a) wherein X and/or Y contain(s) up to or about 25 amino acid residues. 

AdditionaUyprovided herein are locked helix peptides of formula (la), formula (2a), formula (3a), 
formula (4a), formula (5a), formula (6a), formula (7a), formula (8a), formula (9a), formula (10a), formula 
(lla), formula (I2a), formula (13a), formula (14a), formula (15a), formula (16a), formula (1 7a), formula 
35 (18a), formuia(19a) and formula(20a) wherein X and/or Y contains) up to or about 20 amino acid residues. 

Also encompassed herein are locked helix peptides of formula (la), formula (2a), formula (3a), 
formula (4a), formula (5a), formula (6a), formula (7a), formula (8a), formula (9a), formula (10a), formula 
(1 la), formula (12a), formula (13a), formula (14a), formula (15a), formula (16a), formula (17a), formula 
(18a), formula (19a) and formula (20a) wherein X and/orY contain(s) up to or about 1 5 amino acid residues. 
40 Further encompassed herein are locked helix peptides of formula (l a); formula (2a), formula (3a), 

formula (4a), formula (5a), formula (6a), formula (7a), formula (8a), formula (9a), formula (10a), formula 
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(1 la), formula (12a), formula (13a), formula (14a), formula (15a), formula (16a), formula (17a), formula 
( 1 8a), formula ( 1 9a) and formula (20a) wherein X and/or Y contain(s) up to or about 1 0 amino acid residues. 
Additionally encompassed herein are locked helix peptides of formula (la), formula (2a), formula 
(3a), formula (4a), formula (5a), formula (6a), formula (7a), formula (8a), formula (9a) t formula (10a), 
5 formula (1 la), formula (12a), formula (13a), formula (14a), formula (15a), formula (16a), formula { 1 7a), 
formula (18a), formula (19a) and formula (20a) wherein X and/or Y contain(s) up to or about 5 amino acid 
residues. 

Also within the scope of the invention are locked helix peptides of formula (la), formula (2a), 
formula (3a), formula (4a), formula (5a), formula (6a), formula (7a), formula (8a), formula (9a), formula 
10 (10a), formula (1 la), formula (12a), formula (13a), formula (14a), formula (15a), formula (16a), formula 
( 1 7a), formula ( 1 8a), formula ( 1 9a) and formula (20a) wherein X and/or Y contain(s) up to or about 3 amino 
acid residues, - * 

The invention also provides locked helix peptides of formula (lb): 
0=C<NH)-(CH 9 ) n -(NH)-C-0 

15 | I 

(CH 2 ) m (CH 2 ) p 

I - I- - - • ■ 

(NH 2 MCH>C Z~--(NHMCH)-C-Y 

il si 

20 O O 

(lb) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; m and p are independently selected from the integers 0 to 6 inclusive, provided that m+p 
is less than or equal to 6; and n is any integer in the range defined by (7-(m+p)) to (9-(m+p)) inclusive, 
25 provided that n is greater than 1 . 

In another embodiment, the invention provides locked helix peptides of formula (2b): 
0=C-(NH)-(CH 2 ) n -(NH)-C=O 
. | „ ! .. ■ • . • \ .. • .-. 

30 | I 

CH 2 CH 2 

I I 
(NH 2 )-(CH)-C-"-Z-- (NH)-(CH)-C-Y 

- • : II - ^ II . - ' - 

35 O O 

(2b) 

wherein Y is hydroxy] or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 

In yet another embodiment, the invention provides locked helix peptides of formula (3b): 
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O=C-<NHHCH,) n -(NH)-C=0 

! " ■ I 



CH 2 CH 



1 



(NH 2 )-(GH)-C Z— (NH)-(CH>C-Y 

II II 

o o 

10 1 (3b) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any ammo acid sequence consisting 

of six amino acids; and n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (4b): 
? • 0=C-(NHMCH 2 ) n -(NH)-C=0 

15 I I 

CH 2 CH^ 

I I 
CH, CH 2 

I I 
20 (NH 2 MCH)-C Z — (NH)-(CH)-C-Y 

II II 

o o 

(4b) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
25 of six amino acids; and n is any integer from 3 to 5 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (5b): 
0-C-(NH)-(CH 2 ) n -(NH)-C=0 



30 



CH 2 CH 2 

I ; I 
(NH 2 MCH)-C Z-~(NH)-(CH)-C-Y 



I o o 

(5b) 

35 wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any ammo acid sequence consisting 
of six amino acids; and n is any integer from 5 to 7 inclusive. 

: " In still another embodiment, the invention provides locked helix peptides of formula (6b): 

o 

II 

40 H-N-C-(CH 2 ) r -(NH)-C=0 

I I 
i . (CH 2 ) q (GH 2 ) S 

I i 
(NH,MCH>C~Z~ (NH)-(CH>C-Y 

45 II II 

o o 
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(6b) 

wherein Y is hydroxy 1 or is any amino acid or amino acid sequence; Z is any ammo acid sequence coasting 
of six amino acids; q is selected from the integers 1 to 7 inclusive, and s is selected from the integers 0 to 6 
inclusive, provided that q+s is less than or equal to 7; and r is any integer in the range defined by (7-(q+s)) to 
5 (9-(q+s)) inclusive, provided that r is greater than 0. 

in still another embodiment, the invention provides locked helix peptides of formula (7b): 

O 

. II 

H-N-C-(CH 2 ) n -(NH)-C=0 
10 I I 

I I 
CH 2 CH 2 

I I 
15 (NH 2 )-(CH)-C--Z-(NH)-(CH)-C-Y 

II II 

o o 

(7b) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
20 of six amino acids; and n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (8b); 

O 



25 



H-N-C-(CH 2 ) n -(NH>C=0 

I ' ! 

CH 2 | 



CH 2 CH, 

I I 
30 (NH 2 )-(CH)-C~Z~(NH>(CH)-C-Y 

II II 

o o 

(8b) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
35 of six amino acids; and n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (9b): 
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o 

II 

H-N-C-(CH,) n -(NH)-C=0 

I " I" 

CH 2 CH 2 

I I 
CH 2 CH 2 

I I 
(NH 9 )-(CH)-C-Z-(NH)-(CH)-C-Y 



O O 
(9b) 

wherein Y is hydroxylor is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 3 to 5 inclusive. 
15 In still another embodiment, the invention provides locked helix peptides of formula ( 1 Ob): 

O 
II 

H-N-C-(CH 2 ) n -(NH)-C=0 

! I 

20 CH 2 CH 2 

I I 
(NH 2 )-(CH>C-Z-(NH)-(CH)-C-Y 

II II 

o o 

25 (10b) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 

of six amino acids; and n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (1 lb): 

O 

30 II 

OC-(NH)-(CH 2 ) u -C-N-H 

I I 
(CH 2 ) t (CH 2 ) V 

! I 

35 (NH 2 )-(CH)-C~Z--(NHMCH)-C-Y 

II H 

O • O .: ... .... . 

• (Hb) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
40 of six amino acids; t is selected from the integers 0 to 6 inclusive, and v is selected from the integers 1 to 7 
inclusive, provided that t+v is less than or equal to 7; and u is any integer in the range defined by <7-<t+v)) to 
(9-(t+v)) inclusive, provided that u is greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula < 12b): 
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0 =C-(NH)-(CH 2 ) n -C-N-H 

I I - 

I CH 2 

I ! 

CHi CH 2 

I " I 
(NH 2 )-(CH)-C--Z--(NH)-(CH)-C-Y 

II '' II 

o o 

(12b) 

wherein Y is hydroxy) or is any amino acid or amino acid sequence; 2 is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 
15 In still another embodiment, the invention provides locked helix peptides of formula (13b) 

O 

II 

0=C-(NH)-(CH 2 ) n -C-N-H 

I I 
20 CH 2 I 

I 



25 



CH 2 CH 2 

I 1 
(NH 2 )-(CH)-C"Z--(NH)-(CH)-C-Y 



o o 

(13b) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 
30 In still another embodiment, the invention provides locked helix peptides of formula (14b): 

o 
I! 

0=C-(NH)-(CH 2 ) n -C-N-H 

I " I 

35 CH 2 CH 2 

I . I . 

(NH 2 )-(CH)-C--Z--(NH)-(CH)-C-Y 

II H 

o o 

40 (14b) . 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 

of six amino acids; and n is any integer from 5 to 7 inclusive. 

In stilt another embodiment, the invention provides locked helix peptides of formula (15b): 
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o 

11 

0=C-(NH)-(CH 2 ) n -C-N-H 

I I- 
5 CH, CH 2 

I ' I 
CH-, CH 2 

I " I 
(NH 2 )-(CH)-C--Z-(NH>(CH)-C-Y 

10 II II 

O O 

05b) , 
wherein Y is hydroxy! or is any amino acid or amino acid sequence; 2 is any amino acid sequence consisting 

of six amino acids; and n is any integer from 3 to 5 inclusive. 
1 5 in still another embodiment, the invention provides locked helix peptides of formula ( 1 6b): 

O O 

II II 
H-N-C-(CH 2 ) X -C-N-H 

I " ! 

20 (CH 2 ) W (CH 2 ) y 

I I 
(NH 2 )-C-C— Z— N-(CH)-C-Y 

I II I II 

HO H O 

25 (16b) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any ammo acid sequence consistmg 

of six amino acids; w and y are independently selected from the integers 1 to 7 inclusive, provided that w+y 
is less than or equal to 8; and x is any integer in the range defined by (7-(w+y» to (9-(w+y)) inclusive, 
provided that x is greater than or equal to 0. 
30 In still another embodiment, the invention provides locked helix peptides of formula ( 1 7b): 

O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

I I 
35 I CH 2 

I I ... 

" CH 2 CH 2 

I I 
(NH 2 )-C-C— Z-~ N-(CH)-C-Y 

40 1 II I II 

HO H O 

(17b) 

wherein Y is hydroxy! or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 
45 In still another embodiment, the invention provides locked helix peptides of formula (18b): 
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o o 

II II 

H-N-C-(CH 2 ) n -C-N-H 

I I - " 

5 CH, | 

I I 

I I 
(NH 2 )-C-C — Z-~ -N-(CH)-C-Y 

10 I II I i! 

HO H O 
(18b) 

wherein Y is hydroxy 1 or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 
1 5 In still another embodiment, the invention provides locked helix peptides of formula ( 1 9b): 

O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

I I 
20 CH, CH 2 

I " I 
(NH 2 )-C-C— Z — N-(CH)-C-Y 

I II I II 

HO H O 

25 (19b) 

wherein V is hydroxylor is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 

of six amino acids; and n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (20b): 
O O 



30 



H-N-C-(CH 2 ) n -C-N-H 



CH7 

I I 
35 CH 2 CH 2 

S I 

- (NH 2 )-C-C— -Z—N-(CH)-C-Y 

I II I II 

HO H 0 

40 (20b) 

wherein Y is hydroxyl or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 

of six amino acids; and n is any integer from 3 to 5 inclusive. 

Also provided herein are locked helix peptides of formula ( 1 b), formula (2b), formula (3b), formula 

(4b), formula (5b), formula (6b), formula (7b), formula (8b), formula (9b), formula (1 0b), formula (lib), 
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formula (12b), formula (13b), formula (14b), formula (15b), formula (16b), formula ( 1 7b), formula (18b), 
formula (19b) and formula (20b) wherein Y contains up to or about 30 amino acid residues. 

Further provided herein are locked helix peptides of formula (lb), formula (2b), formula (3b), 
formula (4b), formula (5b), formula (6b), formula (7b), formula (8b), formula (9b), formula (10b), formula 
5 (lib), formula (12b), formula (13b), formula (14b), formula (15b), formula (16b), formula (17b), formula 
(18b), formula (19b) and formula (20b) wherein Y contains up to or about 25 amino acid residues. 

Additionallyprovidedherein are locked helix peptides of formula (lb), formula (2b), formula (3b), 
formula (4b), formula (5b), formula (6b), formula (7b), formula (8b), formula (9b), formula (10b), formula 
(lib), formula (12b), formula (13b), formula (14b), formula (15b), formula (16b), formula (17b), formula 
10 (1 8b), formula (19b) and formula (20b) wherein Y contains up to or about 20 amino acid residues. 

Also encompassed herein are locked helix peptides of formula (lb), formula (2b), formula (3b), 
formula (4b), formula (5b), formula (6b), formula (7b), formula (8b), formula (9b), formula (10b), formula 
(lib), formula (12b), formula (13b), formula (14b), formula (15b), formula (16b), formula (17b), formula 
(1 8b), formula (19b) and formula (20b) wherein Y contains up to or about 1 5 amino acid residues. 
15 Further encompassed herein are locked helix peptides of formula (lb), formula (2b), formula (3b), 

formula (4b), formula (5b), formula (6b), formula (7b), formula (8b), formula (9b), formula (10b), formula 
(lib), formula (12b), formula (13b), formula (14b), formula (15b), formula (16b), formula (17b), formula 
(I8b), formula (19b) and formula (20b) wherein Y contains up to or about 10 amino acid residues. 

Additionally encompassed herein are locked helix peptides of formula ( 1 b), formula (2b), formula 
20 (3b), formula (4b), formula (5b), formula (6b), formula (7b), formula (8b), formula (9b), formula (10b), 
formula (1 lb), formula (12b), formula (13b), formula (14b), formula (15b), formula (16b), formula (17b), 
formula (18b), formula (19b) and formula (20b) wherein Y contains up to or about 5 amino acid residues. 

Also within the scope of the invention are locked helix peptides of formula (lb), formula (2b), 
formula (3b), formula (4b), formula (5b), formula (6b), formula (7b), formula (8b), formula (9b), fonnula 
25 (10b), formula (lib), formula (12b), fonnula (13b), formula (14b), fonnula (15b), formula (16b), fonnula 
(17b), formula (18b), formula (19b) and formula (20b) wherein Y contains up to or about 3 amino acid 
residues. 

The invention also provides locked helix peptides of formula (lc): 

0=C-(NH)-(CH 2 ) n -(NH)-C-0 

30 I 1 

<CH 2 ) m (CH 2 ) p 

X-(NH)-(CH)-C Z-»(NHMCH)-C-OH 

II II 
35 O O 

(lc) 

whereinX is hydrogen or is any amino acid or amino acid sequence^ is any amino acid sequence consisting 
of six amino acids; m and p are independently selected from the integers 0 to 6 inclusive, provided that m+p 
is less than or equal to 6; and n is any integer in the range defined by (7-(m+p)) to (9-(m+P)) delusive, 
40 provided that n is greater than 1 . 
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In another embodiment, the invention provides locked helix peptides of formula (2c): 
0=C-(NH)-(CH,) n -(NH)-C=0 

i I 

| - CH 0 

5 I i 

CH 2 CH 2 

i I 
X-(NH)-(CH>C Z — (NHMCH)-C-OH 

II II 
io O O 

(2c) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 

In yet another embodiment, the invention provides locked helix peptides of formula (3c): 
15 0=C-(NH)-(CH 2 ) n -(NH)-C=0 

I I 
CH, ! 

I I 

CH-, CH 2 

20 | " I 

X-(NH)-(CH)-C Z (NH)-(CH)-C-OH 

II II 

o o 

(3c) 

25 wherein X is hydrogen or is any am ino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (4c): 
0=C-(NH)-(CH 2 ) n -(NH)-C=0 

I I 
30 CH, CH 2 

.1 I 
CH 2 CH 2 

I i 
X-(NH)-(CH)-C Z— (NH)-(CH)-C-OH 

35 i! SI 

o o 

(4c) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Z is any am uio acid sequence consisting 
of six amino acids; and n is any integer from 3 to 5 inclusive. 
40 In still another embodiment, the invention provides locked helix peptides of formula (5c): 
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10 



15 



30 



0=C-(NHMCH,) n -(NH)-C=0 

I " I 

CB> CH 2 

I " - i 

X-(NH)-(CH)-C 2— -(NHKCH)-C-OH 



(5c) 



wherein X is hydrogen or is any amino acid or amino acid sequenced is any amino acid sequence consisting 
of six amino acids; and n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (6c): 

O 

II 

H-N-C-(CH 2 ) r -(NH)-O=0 



(CH 2 ) q (CH 2 ) S 

I I 
X-(NHMCH>C-Z— (NH)-(CH)-C-OH 

II " li " 
20 O O 

(6c) 

whereinX is hydrogen or is any amino acid or amino acid sequenced is any amino acid sequence consisting 
of six amino acids; q is selected from the integers 1 to 7 inclusive, and s is selected from the integers 0 to 6 
inclusive, provided that q+s is less than or equal to 7; and r is any integer in the range defined by (7-(q+s» to 
25 (9-(q+s)> inclusive, provided that r is greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (7c): 

O 



H-N-C-(CH 2 ) n -(NH)-C=0 

I " I " 

I - , CH 2 

1 I 
CH 2 CH 2 

I I 
35 X-(NH)-(CH)-C-Z-(NH)-(CH)-C-OH 

||. . . II 

o o 

(7c) 

whereinX is hydrogen or is any ammo acid or amino acid sequenced is any amino acid sequence consisting 
40 of six amino acids; and n is any integer from 4 to 6 inclusive. 

.In still another embodiment, the invention provides locked helix peptides of formula (8c): 
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10 



O 
II 

H-N-C-(CH2) n -(NH)-C=0 

I J 
CH 2 I 

I i 
CH, CH 2 

I " I 
X-(NH)-(CH)-C-Z-(NH)-(CH)-C-OH 



O O 
(8c) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Z is any ammo acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 
15 i n still another embodiment, the invention provides locked helix peptides of formula (9c) : 

O 
II 

H -N-C-(CH 2 ) n -(NH)-C=0 

I I 
20 - CH 2 CH 2 

I I 
CH 2 CH 2 

I ! 

X-(NH)-(CH)-C-Z-(NH)-(CH)-C-OH 
25 II 11 

o o 

(9c) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 3 to 5 inclusive. 
30 In still another embodiment, the invention provides locked helix peptides of formula (10c): 

O 

I! 

H-N-C-(CH 2 ) n -(NH)-C=0 

I I 
35 CH 2 CH 2 

I I 

. X-(NH)-(CH)-C-Z--(NH)-(CH)-C-OH 

II II " ' 

o o 

40 (10c) 

wherein X is hydrogen or is any amino acid or amino acid sequenced is any ammo acid sequence consisting 

of six amino acids; and n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula ( 11 c): 
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10 



15 



20 



25 



30 



0 =C-(NH)-(CH 2 ) U -C-N-H 



(CH 2 X 
I 



I 

(CH 2 ) V 



X-(NH)-(CH)-C~Z--(NH)-(CH)-C-OH 
II II 

o o 

5 (He) 

whereinX is hydrogen or is any amino acid or amino acid sequenced is any amino acid sequence consisting 
of six amino acids; t is selected from the integers 0 to 6 inclusive, and v is selected from the integers I to 7 
inclusive, provided thatt+v is less than or equal to 7; and u is any integer in the range defined by (7-<t+v)) to 
(9-(t+y)) inclusive, provided that u is greater than 0. 

! In still another embodiment, the invention provides locked helix peptides of formula ( 1 2c): 

O 

II 

0=C-(NH)-(CH 2 ) n -C-N-H 



35 



40 



I 



CH, 



I 

CH 2 
1 

CH-, 



X-(NH)-(CH)-C~Z--(NH)-(CH)-C-OH 

I II I! 

o o 

(12c) 

wherein X js hydrogen or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is ; any integer from 4 to 6 inclusive. 

! In still another embodiment, the invention provides locked helix peptides of formula (13c): 

i i 

i : O 

) i II 

■ 0<:-(NHMCH 2 ) n -C-N-H 

I I 
CH 2 1 



CH 2 r 



I 

CH-, 



X-(NH)-(CH)-C-Z-(NH)-(CH)-C-OH 
II II 

o o 

(13c) 



wherein X is hydrogen or iJany amino acid or amino acid sequenced is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 
45 In still another embodiment, the invention provides locked helix peptides of formula ( 1 4c): 
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0=C-(NH)-(CH,) n -C-N-H 

I "I 
5 CH 2 CH 2 

I I 
X-(NH)-(CH>C»Z»(NH)-(CH)-C-OH 

II II 

o o 

10 (14c) 

wherein X is hydrogen or is any amino acid or amino acid sequenced is any amino acid sequence consisting 

of six amino acids; and n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (15c): 

o 

15 II "" 

OC-(NH>(CH 2 ) n -C-N-H 

I I 
CH-, CH 9 

I " I 

20 CH 2 CH 2 

I I 
X-(NH)-(CH)-C--Z-(NH)-(CH)-C-OH 

II II 

o o 

25 (15c) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 

of six amino acids; and n is any integer from 3 to 5 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (16c): 

O O 

30 || II 

H-N-C-(CH 2 ) X -C-N-H 

I i 
(CH 2 ) W (CH 2 ) y 

I I 
35 X-(NH)-C-C— Z--N-(CH)-C-OH 

Ml I II 

HO H O 

(16c) . ' " * 

wherein X is hydrogen or is any amino acid or amino acid sequence; Z is any ammo acid sequence consisting 

40 of six amino acids; w and y are independently selected from the integers 1 to 7 inclusive, provided that w+y 
is less than or equal to 8; and x is any integer in the range defined by (7-(w+y)) to (9-(w+y)) inclusive, 
provided that x is greater than or equal to 0. 

In still another embodiment, the invention provides locked helix peptides of formula (17c): 
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O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

I " I 

5 | CH 2 - 

I I 
CH, CH 2 

I " I 
X-(NH>C-C— Z-— N-(CH)-C-OH 

10 I II I II 

HO H O 

wherein X is hydrogen or is any amino acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 
15 In still another embodiment, the invention provides locked helix peptides of formula (1 8c): 

O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

I _ I 

20 . CH 2 | 

I I 
CH 2 CH 2 

I I 
X-(NH)-C-C~- Z— N-(CH)-C-OH 

25 I II I II 

HO H O 

(18c) 

wherein X is hydrogen or is any ammo acid or amino acid sequence; Z is any amino acid sequence consisting 
of six amino acids; and n is any integer from 4 to 6 inclusive. 
30 In still another embodiment, the invention provides locked helix peptides of formula ( 1 9c): 

O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

I I 
35 CH 2 CH 2 

I ! 

X-(NH)-C-C— -Z— N-(CH)-C-OH 

I II ' I H 
HO H O 

40 (19c) 

wherein X is hydrogen or is any amino acid or amino acid sequence; Z is any amino acid sequence consisttng 

of six amino acids; and n is any integer from 5 to 7 inclusive. 

in still another embodiment, the invention provides locked helix peptides of formula (20c): 
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O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

I 1 
5 CH 2 CH 2 ' 

I I 
CH 2 CH 2 



10 



X-(NH)-C-C— -Z--N-(CH)-C-OH 



HO H O 

(20c) 

wherein X is hydrogen or is any amino acid or amino acid sequenced is any amino acid sequence consisting 
of six amino acids; and n is any integer from 3 to 5 inclusive. 
15 Also provided herein are locked helix peptides of formula ( 1 c), formula (2c), formula (3c), formula 

(4c), formula (5c), formula (6c), formula (7c), formula (8c), formula (9c), formula (10c), formula (lie), 
formula (12c), formula (13c), formula (14c), formula (15c), formula (16c), formula (17c), formula (18c), 
formula (19c) and formula (20c) wherein X contains up to or about 30 amino acid residues. 

Further provided herein are locked helix peptides of formula ( 1 c), formula (2c), formula (3 c), formula 
20 (4c), formula (5c), formula (6c), formula (7c), formula (8c), formula (9c), formula (10c), formula (lie), 
formula (12c), formula (13c), formula (14c), formula (15c), formula (16c), formula (17c), formula (18c), 
formula (1 9c) and formula (20c) wherein X contains up to or about 25 amino acid residues. 

Additionally provided herein are locked helix peptides of formula ( 1 c), formula (2c), formula (3c), 
formula (4c), formula (5c), formula (6c), formula (7c), formula (8c), formula (9c), formula (10c), formula 
25 (11c), formula (12c), formula (13c), formula (14c), formula (I5c), formula (16c), formula (17c), formula 
(1 8c), formula (19c) and formula (20c) wherein X contains up to or about 20 amino acid residues. 

Also encompassed herein are locked helix peptides of formula (lc), formula (2c), formula (3c), 
formula (4c), formula (5c), formula (6c), formula (7c), formula (8c), formula (9c), formula (10c), formula 
(11c), formula (12c), formula (I3c), formula (14c), formula (15c), formula (16c), formula (17c), formula 
30 (18c), formula (19c) and formula (20c) wherein X contains up to or about 15 amino acid residues. 

Further encompassed herein are locked helix peptides of formula (lc), formula (2c), formula (3c), 
formula (4c), formula (5c), formula (6c), formula (7c), formula (8c), formula (9c), formula (10c), formula 
(1 lc), formula (12c), formula (13c), formula (14c), formula (15c), formula (16c), formula (17c), formula 
( 1 8c), formula ( 1 9c) and formula (20c) wherein X contains up to or about 1 0 amino acid residues. 
35 Additionally encompassed herein are locked helix peptides of formula (lc), formula (2c), formula 

(3c), formula (4c), formula (5c), formula (6c), formula (7c), formula (8c), formula (9c), formula (10c), 
formula (1 lc), formula (12c), formula (I3c), formula (14c), formula (15c), formula (16c), formula (17c), 
formula (18c), formula (19c) and formula (20c) wherein X contains up to or about 5 amino acid residues. 

Also within the scope of the invention are locked helix peptides of formula (lc), formula (2c), 
40 formula (3c), formula (4c), formula (5c), formula (6c), formula (7c), formula (8c), formula (9c), formula 
(10c), formula (1 lc), formula (12c), formula (13c), formula (14c), formula (15c). formula (16c). formula 
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(1 7c), formula (18c), formula (19c) and formula (20c) wherein X contains up to or about 3 amino acid 
residues. 

The invention also provides locked helix peptides of formula (Id): 

0=C-(NH)-(CH 2 ) n -(NH)-C=0 

5 I I. 

(CH 2 ) m (CH 2 ) p 

I I 
(NH,)-(CH)-C— - Z---(NH)-(CH)-C-OH 

II II 
10 o o 

(Id) 

wherein Z is any amino acid sequence consisting of six amino acids; m and p are independently selected from 
the integers 0 to 6 inclusive, provided that m+p is less than or equal to 6; and n is any integer in the range 
defined by (7-(m+p)) to (9-(rn+p)) inc lusive, provided that n is greater than 1 . 
1 5 in another embodiment, the invention provides locked helix peptides of formula (2d): 

0=C-(NH)-(CH 2 ) n -(NH)-C=0 

I I 
| CH, 

I - . I 

20 CH 2 CH 2 

I I 
(NH 2 )-(CH)-C— -Z— - (NHMCH)-C-OH 

II II 

o o 

25 ( 2d ) ■ A <■ fc ■ 

wherein Z is any amino acid sequence consistingof six amino acids; and n is any integer from 4 to 6 inclusive. 

In yet another embodiment, the invention provides locked helix peptides of formula (3d): 

0=C-(NH)-(CH 2 ) n -(NH)-C=0 

■ I " " ■ I . ■ ■ - . 

30 CH 2 I 

! ! 

CH 2 CH 2 

I I 
(NH 2 )-(CH)-C Z--(NH)-(CH)-C-OH 

35 "II H 

o o 

(3d) 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
In still another embodiment, die invention provides locked helix peptides of formula (4d): 
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0=C-(NH)-(CH 7 ) n -(NH)-C=O 

i I 
CH-, CH 2 

I " I 
5 CH 2 " CH 2 

[ i 
(NH 2 )-(CH)-C Z--(NH)-(CH)-C-OH 

II II 

o o 

10 (4d) 

wherein Z is any am ino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (5d): 
0=C-(NH)-(CH 2 ) n -(NH)-C=O 

I I 
15 CH 2 CH 2 

. I I 
(NH 2 )-(CH)-C Z— (NH)-(CH)-C-OH 

II II 

o o 

20 (5d) 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (6d): 



25 H-N-C-(CH 2 ) r -(NH)-C=0 

I I 
(CH 2 ) q (CH 2 ) S 

I I 
(NH 2 MCH)-C-Z— (NH)-(CH)-C-OH 

30 " II II 

o o 

(6d) 

wherein Z is any amino acid sequence consisting of six amino acids; q is selected from the integers 1 to 7 
inclusive, and s is selected from the integers 0 to 6 inclusive, provided that q+s is less than or equal to 7; and 
35 r is any integer in the range defined by (7-(q+s)) to (9-(q+s)) inclusive, provided that r is greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (7d): 



-44- 



BNSDOCID: <WO 96a0036A1_L> 



WO 98/20036 PCT/US97/20069 



H-N-C-(CH 2 ) n -(NH)-C=0 

I * I 

| CH 2 



CH 2 CH, 

i I 
(NH 2 )-(CH)-C-Z-(NH)-(CH)-C-OH 



10 



o o 

(7d) . 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (8d): 

15 O 

il 

H-N-C-(CH 2 ) n -(NH)-C=0 

I I 
CH 2 I 



20 



.25 



CH 2 CH, 

S 1 
(NH 2 )-(CH)-C-Z-(NH)-(CH)-C-OH 

I! H 
o o 

(8d) 

wherein Z is any amino acid sequence consisting of six amino acids; andn is any integer from 4 to 6 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (9d): 
O 



30 



H-N-C-(CH 2 ) n -(NH)-C=0 

I I : ; 

CH 2 CH 2 

I I 
35 CH 2 CH 2 

1 I 
(NH 2 )-(CH)-C-Z-(NH>(CH)-C-OH 



40 



. . o o . 

(9d) 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula ( I Od): 
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o 

II 

H-N-C-(CH 2 ) n -(NH)-C=0 

I I 

5 cn-y cuo 

I " ! 

(NH 2 )-(CH)-C-Z--(NH)-(CH)-C-OH 
II H 

o o 

10 (lOd) 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (lid): 

O 
11 

15 0=C-(NH)-(CHo) u -C-N-H 

I I 
(CH 2 ) t (CH 2 ) V 

I I 
(NH 2 )-(CH)-C-Z--(NH)-(CH)-C-OH 

.20 "II II 

o o 

(lid) 

wherein Z is any amino acid sequence consisting of six amino acids; t is selected from the integers 0 to 6 
inclusive, and v is selected from the integers 1 to 7 inclusive, provided that t+v is less than or equal to 7; and 
25 u is any integer in the range defined by (7-(t+v)) to (9-<t+v)) inclusive, provided that u is greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (12d): 

O 

il 

0=C-(NH)-(CH 2 ) n -C-N-H 
30 |l 

I CH 2 

. I 1 
CH 7 CH 2 

I I 
35 (NH 2 )-(CH)-C--Z--(NH)-(CH)-C-OH 

II II 

o o 

(12d) 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
40 In still another embodiment, the invention provides locked helix peptides of formula (13d): 
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o 

11 

0=C-(NH)-(CH 2 ) n -C-N-H 

I ' I 

5 CH 2 r 

I I 

CH 9 CH 2 

! " I 
(NH 2 )-(CH)-C--Z-(NH>(CH)-C-OH 

10 " II II 

o o 

(13d) 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (14d): 



15 



20 



o 

II 

0=C-(NH)-(CH 2 ) n -C-N-H 

I " I 

CH 2 CH 2 

I I 
(NH,)-(CH>C-Z-(NH)-(CH)-C-OH 



O O 
(14d) 

25 wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (15d): 

O 

' II ' " 

0=C-CNH)-(CH 2 ) n -C-N-H 

30 I I 

CH 2 7 CH 2 

I I 
CH 2 CH 2 

I I 
35 (NH 2 )-(CH)-C— Z--(NH)-(CH)-C-OH 

II II 

o o 

(15d) . 
wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 

40 In still another embodiment, the invention provides locked heiix peptides of formula ( 1 6d): 
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o o 

I) 11 

H-N-C-(CH 2 ) X -C-N-H 

i I 
5 (CH 2 ) W (CH 2 ) y 

I I 
(NH 2 )-C-C-~Z~ -N-(CH)-C-OH 

II! 1 H 

HO H O 

10 (16d) 

wherein Z is any amino acid sequence consisting of six amino acids; w and y are independently selected from 

the integers 1 to 7 inclusive, provided that w+y is less than or equal to 8; and x is any integer in the range 
defined by (7-(w+y)) to (9-(w+y)) inclusive, provided that x is greater than or equal to 0. 

In still another embodiment, the invention provides locked helix peptides of formula ( 1 7d): 
15 O O 

II IS 

H-N-C-(CH2) n -C-N-H 

I I 
I CH 2 

20 I I 

CH 2 CH 2 

I I 
(NH 2 )-C-C— -Z~ N-(CH)-C-OH 

1 II I II 

25 HO H O 

(17d) 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula ( 1 8d): 



30 



o o 

II II 

H-N-C-(CH 2 ) n -C-N-H 



CH 2 



I I 
35 CH 2 CH 2 

i I 
(NH 2 )-C-C— -Z— - N-(CH)-C-OH , 

mi i ii • - ' . " : 

HO H O 

wherein Z is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive, 
in still another embodiment, the invention provides locked helix peptides of formula { 1 9d): 
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o o 

II il 

H-N-C-(CH 2 ) n -C-N-H 

! I 
5 CH 2 CH-, 

! I 
(NH 2 )-C-C-- -Z— N-(CH)-C-OH 

I II I II 

HO H O 

10 (19d) 

wherein 2 is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 

In stili another embodiment, the invention provides locked helix peptides of formula (20d): 



15 " f H-N-C-(CH 2 ) n -C-N-H 

I I 
CH 2 CH 2 



20 



30 



CH 2 CH 2 



(NH 2 )-C-C--Z— N-(CH)-C-OH 

I II ' I II 
HO H O 

(20d) 

25 wherein Z is any amino acid sequence consisting of six amino acids;and n is any integer from 3 to 5 inclusive, 
i The invention also provides locked helix peptides of formula ( 1 e): 
0=C-(NHHCH 2 ) n -(NH)-C=0 



t (CH 2 ) m I (CH 2 ) p 

I I 

j (NH 2 )-(CH)-C- r -Z— -(NH)-(CH)-C-Y-S 



o ■ o 

■I (le) 
35 wherein S is absent or is a macromolecule; Y is absent, or is hydroxy! if S is absent, or is any amino acid or 

amino acid sequenced is any amino acid sequence consisting of six amino acids; m and p are independently 

selected from the integers 0 to 6 inclusive, provided that m+p is less than or equal to 6; and n is any integer 

in the range defined by (7-{m+p)) to (9-(m+p)) inclusive, provided that n is greater than 1 - 

] In another embodiment, the invention provides locked helix peptides of formula (2e): 
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0=C-(NH)-(CH,) n -(NH)-C=O 

! " I 

I I 
5 CH, CH 2 

I I 
(NH,HCH)-C— Z— (NH>(CH)-C-Y-S 

II 11 

o o 

10 (2e) . 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In yet another embodiment, the invention provides locked helix peptides of formula (3e): 
15 0=C-(NH)-(CH 2 ) n -(NH>C=0 

I I 
CH, I 

I " i 
CH 2 CH 2 

20 I I 

(NH 2 )-(CH)-C Z— (NH)-(CH)-C-Y-S 

11 II 

o o 

(3e) 

25 wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any ammo acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (4e): 

0=C-{NH)-(CH 2 ) n -(NH)-C=0 

30 I I 

CH, CH 2 

I I 
CH 2 CH 2 

I I 
35 (NH 2 )-(CH)-C Z-.~(NH)-(CH)-C-Y-S 

II li 

o o 

(4e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any ammo acid or 
40 amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
3 to 5 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (5e): 
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0=C-(NH)-(CH^) n -CNH>C=0 

I ' I 

CH 2 CH 2 

1 . i 

5 (NH 2 )-(CH)-C Z-~(NH)-(CH)-C-Y-S 

11 !l 

o o 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxy! if S >s absent, or is any amino acid or 
10 amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (6e): 
O 

I! ,■..„■• 

15 H-N-C-(CH 2 ) r -(NH)-C=0 

I ~ I 

(CH 2 ) q (CH 2 ) S 

I I 
(NH 2 )-(CH>C-Z— (NH)-(CH)-C-Y-S 

20 " II H 

o o 

(6e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; q is selected from the 
25 integers 1 to 7 inclusive, and s is selected from the integers 0 to 6 inclusive, provided that q+s is less than or 
equal to 7; and r is any integer in the range defined by (7-(q+s)) to (9-(q+s)) inclusive, provided that r is 
greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (7e): 



30 



O 

• II 

H-N-C-(CH 2 ) n -(NH)-<>0 



j CH, 

I. i 
35 CHa CH 2 

I " I - . . 

(NH 2 MCH>C-Z-(NH)-(CH)-C-Y-S 

Si II 
o o 

40 (7e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (8e): 
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O 

I! 

H-N-C-(CH 2 ) n -(NH)-C=0 

I I 
5 CH, | ■ 

I " I 
CH, CH 2 

I " I 
(NH 2 )-(CH)-C--Z»(NHMCH)-C-Y-S 
10 " II II 

o o 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
15 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (9e): 
O 



H-N-C-(CH 2 ) n -(NH)-C=0 



20 



CH 2 CH 2 

I I 
CH 2 CH 2 

I I 
25 (NH 2 )-(CH)-C-Z"(NH)-(CH)-C-Y-S 

il II 

o o 

(9e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 
30 amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
3 to 5 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (tOe): 

O 
II 

35 H-N-C-(CH 2 ) n -(NH>C=0 

1 " 1 

I I 
(NH 2 )-(CH>C~Z--(NH)-(CH)-C-Y-S 

40 " li li 

o o 

(lOe) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any ammo acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
45 5 to 7 inclusive. 
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In still another embodiment, the invention provides locked helix peptides of formula (1 1 e):- 

O 

II 

0=C-(NH)-(CH->) u -C-N-H 

5 I I 

(CH 2 ) t (CH 2 ) V 

I ! 

(NH 2 )-(CH)-C-Z-(NH)-(CH)-C-Y-S 

il II 

10 o o 

(lie) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxy 1 if S is absent, or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; t is selected from the 
integers 0 to 6 inclusive, and v is selected from the integers 1 to 7 inclusive, provided that t+v is less than or 
1 5 equal to 7; and u is any integer in the range defined by (7-(t+v» to <9-(t+v)) inclusive, provided that u is 
greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (12e): 

O 
II 

20 0-C-(NH)-(CH 2 ) n -C-N-H 

I I ■ 

I CH 2 

I I 
CH 2 CH 2 

25 | I 

CNH 2 )-(CH)-C-Z--(NH)-(CH)-C-Y-S 

II II 

o o 

..,.„.,., (12e) 
30 wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any ammo acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula ( 13e): 

o 

35 II 

OC-(NH)-(CH 2 ) n -C-N-H 

I " I 

CH 2 | 

I I 
40 CH 2 CH 2 

! I 

.. (NH 2 )-(CH)-C-Z-(NH)-(CH)-C-Y-S 

II .11 

o o 

45 (13e) 
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10 



wherein S is absent or is a macromolecule: Y is absent; or is hydroxyl if S is absent or is any amino acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (1 4e): 

O 
II 

0=C-(NH)-(CH 2 ) n -C-N-H 

I I 
CH 2 CH 2 

I I 
(NH 2 )-(CH)-C-Z--{NH)-(CH)-C-y-S 



o o 

(14e) . 
1 5 wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (15e): 

O 

20 II 

0=C-(NH)-(CH 2 ) n -C-N-H 

I 1 
CH 2 CH 2 

I I 
25 CH 2 CH 2 

I I 
(NH 2 MCH)-C--Z--(NH)-(CH)-C-Y-S 

II II 

o o 

30 (15e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
3 to 5 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (16e); 

35 O O 

II II 
H-N-C-(CH 2 ) X -C-N-H 

I " I 

(CH 2 ) W (CH 2 ) y 



40 



(NH 2 )-C-C— Z— -N-(CH)-C-Y-S 

I II I II 

HO H O 
(16e) 
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wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 
amino acid sequenced is any amino acid sequence consisting of six amino acids; w and y are independently 
selected from the integers 1 to 7 inclusive, provided that w+y is less than or equal to 8; and x is any integer 
in the range defined by (7-(w+y)) to <<Mw+y)) inclusive, provided that x is greater than or equal to 0. 
5 In still another embodiment, the invention provides locked helix peptides of formula { 1 7e): 

O O 

SI II 

H-N-C-(CH 2 ) n -C-N-H 

I I 
10 1 CH 2 

I I 
CH 2 CH 2 

I i 
(NH 2 )-C-C— -Z»~N-(CH)-C-Y-S 

15 I II I II 

HO H O 

(17e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxy! if S is absent, or is any amino ac.d or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
20 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula ( 1 8e): 

o o 

II II 

H-N-C-(CH 2 ) n -C-N-H 

25 - t- ... I 

CH 2 I 

I ! 

CH 2 CH 2 
30 (NH 2 )-C-C— Z-~N-(CH)-C-Y-S 

"I „---, • = • • 

HO H O 

(18e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any ammo acid or 
35 amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula ( 19e): 



-55- 



9820036A1J > 



WO 98/20036 



PCTYUS97/20069 



o o 

II II 

H-N-C-(CH 2 ) n -C-N-H 

J I . 

5 CH> CH 2 

I " I 
(NH,)-C-C— Z— -N-(CH)-C-Y-S 

"Ml i II 

HO H O 

10 (19e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any amino acid or 

amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of fonnula (20e): 
O O 



15 



20 



25 



H-N-C-(CH 2 ) n -C-N-H 

I ! 

CH 2 CH 2 

I I 
CH 2 CH 2 

I I 
(NH 2 )-C-C— -Z— -N-(CH)-C-Y-S 



HO H O 

(20e) 

wherein S is absent or is a macromolecule; Y is absent, or is hydroxyl if S is absent, or is any ammo acid or 
amino acid sequence; Z is any amino acid sequence consisting of six amino acids; and n is any integer from 
3 to 5 inclusive. 

30 Also provided herein are locked helix peptides of formula { 1 e), formula (2e), formuia (3e), formula 

(4e), formula (Se), formuia (6e), formula (7e), formula <8e), formuia (9e), formula (lOe), fonnula (lie), 
fonnula (12e), formula (13e), formula (Me), formula (I5e), formula (I6e), formula (!7e), fonnula (18e), 
formula (19e) and fonnula (20e) wherein Y contains up to or about 30 amino acid residues. 

Further provided herein are locked helix peptides of formula (le), formulate), formulate), formula 
35 (4e), formula (5e), formula (6e), formula (7e), formula (8e), formula (9e), fonnula (lOe), formula (1 le), 
formula (12e), formula (13e), fonnula (14e), formula (15e), formula (16e), formula (I7e), formula (18e), 
formula (19e) and fonnula (20e) wherein Y contains up to or about 25 amino acid residues. 

Additionally provided herein are locked helix peptides of fonnula (le), fonnula (2e), formula (3e), 
fonnula (4e), formula (5e), formula (6e), formula (7e), formula (8e), formula (9e), fonnula (lOe), formula 
40 (1 le), formula (12e), fonnula (13e), fonnula (14e), formula (15e), formula (16e), formula (17e), fonnula 
(18e), formula ( 19e) and formula (20e) wherein Y contains up to or about 20 amino acid residues. 

Also encompassed herein are locked helix peptides of formula (le), formula (2e), formula (3e), 
formula (4e), fonnula (5e), formula (6e), formula (7e), formula (8e), formula (9e), formula (lOe), fonnula 
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(He), formula (12e). formula (13e), formula (He), formula (ISe), formula (16e), formula (I7e), formula 
(18e), formula (19e) and formula (20e) wherein Y contains up to or about 15 amino acid residues. 

Further encompassed herein are locked helix peptides of formula (le), formula (2e), formula (3e), 
formula (4e), formula (5e), formula (6e), formula (7e), formula (8e), formula (9e), formula (lOe), formula 

5 (lie), formula (12e), formula (13e) s formula (14e), formula (15e), formula (16e), formula (17e), formuia 
(18e), formula (19e) and formula (20e) wherein Y contains up to or about 10 amino acid residues. 

Additionally encompassed herein are locked helix peptides of formula (le), formula (2e), formula 
(3e), formula (4e), formula (5e), formula (6e), formula (7e), formula (8e), formula (9e), formula (lOe), 
formula (1 le), formula (12e), formula (13e), formula (14e), formula (15e), formula (16e), formula (17e), 

10 formula (18e), formula (19e) and formula (20e) wherein Y contains up to or about 5 amino acid residues. 

Also within the scope of the invention are locked helix peptides of formula (le), formula (2e), 
formula (3e), formula (4e), formula (5e), formula (6e), formula (7e), formula (8e), formula (9e), formula 
(lOe), formula (lie), formula (12e), formula (13e), formula (14e), formula (15e), formula (16e), formula 
(17e), formula (18e), formula (19e) and formula (20e) wherein Y contains up to or about 3 amino acid 

15 residues. 

The invention also provides locked helix peptides of formula (If): 

0=C-(NH)-(CH 2 ) n -(NH)-C=0 

I I 
(CH 2 ) m (CH 2 ) p 



20 



X-(NH)-(CH)-C Z — (NH)-(CH)-C-S 



o o 
(If) 

25 wherein S is hydroxyl or is a macromolecule; X is hydrogen or is any amino acid or ammo acid sequence; Z 
is anyaminoacid sequence consisting of six amino acids; m and p are independentlyselectedfrom the integers 
0 to 6 inclusive, provided that m+p is less than or equal to 6; and n is any integer in the range defined by (7- 
(ih+p)) to (9-(m+p)) inclusive, provided that n is greater than 1 . 

In another embodiment, the invention provides locked helix peptides of formula (2f): 

30 0=C-(NH)-(CH 2 ) n -(NfH)-C=0 

I I 
I CH 2 

I s 

CH 2 CH 2 

35 I I 

X-(NH)-(CH)-C Z— (NH)-(CH)-C-S 

II II 

o o 

(2f) 

40 wherein S is hydroxyl or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Z 
is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
In yet another embodiment, the invention provides locked helix peptides of formula (3f): 
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0=C-(NH)-(CH,) n -(NH)-C=0 

I " I 

CH, I 

I " I 
5 CH, GH 2 

i ' I 
X-(NH)-(CH)-C— -Z— -(NH)-(CH)-C-S 

II il 

o o 

10 (3f) 

wherein S is hydroxy] or is a macromoiecule; X is hydrogen or is any amino acid or amino acid sequence; Z 

is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 

In stiJl another embodiment, the invention provides locked helix peptides of formula (4f): 

0=C-(NH)-(CH 2 ) n -(NH)-C=0 

15 | I 

CH, CH 2 

I " I 
CH, CH 2 

I " I 
20 X-(NH)-(CH)-C — -Z-— (NH)-(CH)-C-S 

II . II 

o o 

(4f) 

wherein S is hydroxyl or is a macromoiecule; X is hydrogen or is any ammo acid or ammo acid sequence; Z 
25 is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 

In stil! another embodiment, the invention provides locked helix peptides of formula (5f): 
0=C-(NH)-(CH 2 ) n -(NH)-C=0 



30 



CH, CH, 

r " i 

X-(NH)-(CH)-C Z-~-(NHMCH)-C-S 



o o 

(Sf) 

35 wherein S is hydroxyl or is a macromoiecule; X is hydrogen or is any amino acid or ammo acid sequence; Z 
is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (6f): 
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J 

o 

II 

H-N-C-(CH 2 ) r -(NH)-C=0 

I I. 
5 (CH 2 ) q (CH 2 ) S 

X-(NH)-(CH>C-Z— (NH)-(CH)-C-S 

II il 

o o 

10 1 (6f) 

wherein S is hydroxylor is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Z 

is any amino acid sequence consisting of six amino acids; q is selected from the integers 1 to 7 inclusive, and 

s is selected from the integers 0 to 6 inclusive, provided that q+s is less than or equal to 7; and r is any integer 

in the range defined by (7-(q+s)) to (9-(q+s)) inclusive, provided that r is greater than 0. 

1 5 : in still another embodiment, the invention provides locked helix peptides of formula (7f>: 



H-N-C-(CH 2 ) n -(NH)-C=0 

I I I 

20 I CH 2 

I I 
CH 2 CH 2 



25 



X-(NHMCH)-C--Z»(NH)-(CH)-C-S 



o: o 

(7f) 

whereiiiSis hydroxylor is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Z 
is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
30 .• In still another embodiment, the invention provides locked helix peptides of formula <8f): 

O 



i H-N-C-(CH 2 ) n -(NH)-C=0 

I 1 
35 . CH 2 I 

I I 
- j - CH 2 CH 2 

: ' l ' I 

! X-(NH)-(CH)-C-Z-CNH)-(CH>C-S 
40 II II 

o o 

! (8f) 
wherein S is hydroxyl or is a macromolecule; X is hydrogen or is any amino acid or ammo acid sequence; Z 

is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 

45 In still another embodiment, the invention provides locked helix peptides of formula <9f): 
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0 
II 

H-N-C-(CH 2 ) n -(NH)-C>0 

I ~ I. 

CH 2 CH 2 

I i 
CH 2 CH 2 

I I 
X-(NH)-(CH)-C-2~(NH)-(CH).C-S 



o o 

(9f) . ., 

wherein S is hydroxyl or is a macromolecuie; X is hydrogen or is any ammo acid or amino acid sequence; Z 

is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 

15 in still another embodiment, the invention provides locked helix peptides of formula ( 1 Of): 



H-N-C-(CH 2 ) n -(NH)-C=0 

I ' I 

20 CH 2 CH, 

I s 

X-(NH)-(CH)-C-Z-(NH)-(CH)-C-S 
II II 

o o 

25 (10f) 

wherein S is hydroxy! or is a macromolecuie; X is hydrogen or is any ammo acid or ammo acid sequence; L 

is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula ( 11 f): 

O 

30 I! 

0-C-(NH)-(CH 2 ) u -C-N-H 

i " I 

(CH2) t (CH 2 ) V 

I I 
35 X-(NH)-(CH)-C~Z-(NH)-(CH)-C-S 

II I! 
o o 

(llf) 

wherein S is hydroxyl or is a macromolecuie; X is hydrogen or is any amino acid or amino acid sequence; Z 
40 is any amino acid sequence consisting of six amino acids; t is selected from the integers 0 to 6 inclusive, and 
v is selected from the integers 1 to 7 inclusive, provided that t+v is less than or equal to 7; and u is any integer 
in the range defined by (7-(t+v)) to (9-(t+v)) inclusive, provided that u is greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (12f): 
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10 



25 



0 =C-(NH)-(CH 2 ) n -C-N-H 

i " I 

| CH 2 

i I 
CH 2 CH^ 

| I 
X-(NH)-(CH)-C~Z--(NH)-(CH)-C-S 



O O 
(X2f) 

wherein S is hydroxyl or is a macromolecuie; X is hydrogen or is any amino acid or ammo acid sequence; Z 
is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
1 5 In still another embodiment, the invention provides locked helix peptides of formula (1 3f): 

o 
I! 

OC-(NH)-(CH 2 ) n -C-N-H 

I I 
20 CH 2 I 

I 1 
CH 2 CH 2 



X-(NH)-(CH)-C--Z--(NH)-(CH)-C-S 



o o 

(13f) 

wherein S is hydroxyl or is a macromolecuie; X is hydrogen or is any amino acid or amino acid sequence; Z 
is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
30 In still another embodiment, the "invention provides locked helix peptides of formula ( 1 4f): 

O 
II 

0=C-(NH)-(CH 2 ) n -C-N-H 

1 I 
35 CH 2 CH 2 

i I 
X-(NH)-(CH)-C-Z-(NH)-(CH)-C-S 

o o 

40 04f) 

wherein S is hydroxyl or is a macromolecuie; X is hydrogen or is any amino acid or amino acid sequence; Z 

is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (150: 
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o 

It 

0=C-(NH)-(CH 2 ) n -C-N-H 

I I 
5 CH, CH 2 

I " i 
CH 9 CH, 

I " I 
X-(NH)-(CH)-G--Z--(NH>(CH)-C-S 

10 II II 

O O 

wherein S is hydroxy! or is a macromolecule; X is hydrogen or is any amino acid or amino acid sequence; Z 
is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 
1 5 in still another embodiment, the invention provides locked helix peptides of formula { 1 6f): 

O O 

li II 
H-N-C-(CH 2 ) X -C-N-H 

I " I 

20 (CH 2 ) W (CH 2 ) y 

I I 
X-(NH)-C-C— Z — N-(CH>C-S 

I II I II 

HO H O 

25 (16f> 

wherein S is hydroxyl or is a macromolecule; X is hydrogen or is any amino acid or ammo acid sequence; Z 

is any amino acid sequence consisting of six amino acids; w and y are independently selected from the integers 

1 to 7 inclusive, provided that w+y is less than or equal to 8; and x is any integer in the range defined by (7- 

(w+y)) to (9-(w+y)) inclusive, provided that x is greater than or equal to 0. 

30 In still another embodiment, the invention provides locked helix peptides of formula (17f): 

o o 

II II 

H-N-C-(CH 2 ) n -C-N-H 

I " I 

35 | CH 2 

I ! 

CH, CH 2 

I " I 
X-(NH)-C-C — Z~~N-(CH>C-S 

40 . . . | II I II 

HO H O 

(17f) 

wherein S is hydroxyl or is a macromolecule; X is hydrogen or is any amino acid or ammo acid sequence; Z 
is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
45 in still another embodiment, the invention provides locked helix peptides of formula <18f): 
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O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

I I 
5 CH, | ' 

1 I 
CH, CH 2 

I " I 
X-(NH)-C-C— -Z— -N-(CH)-C-S 

io I II i II 

HO H O 

(18f) 

wherein S is hydroxyl or is a macromolecuie; X is hydrogen or is any amino acid or amino acid sequence; Z 
is any amino acid sequence consisting of six amino acids; and n is any integer from 4 to 6 inclusive. 
15 In still another embodiment, the invention provides locked helix peptides of formula (19f): 

O O 

II I! 
H-N-C-(CH 2 ) n -C-N-H 

i ! 
20 CH 2 CH 2 

I I 
X-(NH)-C-C— Z— -N-(CH)-C-S 

Ml I II 

HO H O 

25 < 19f > 

wherein S is hydroxyl or is a macromolecuie; X is hydrogen or is any amino acid or amino acid sequence; Z 

is any amino acid sequence consisting of six amino acids; and n is any integer from 5 to 7 inclusive. 
In still another embodiment, the invention provides locked helix peptides of formula (20f): 
O O 

30 II II 

H-N-C-(CH 2 ) n -C-N-H 

I I 
CH 2 CH 2 

I ! 

35 CH 2 CH 2 

I I 
X-(NH)-C-C— Z— N-(CH)-C-S 

I II I • • II 

HO H O 

40 (20f) 

wherein S is hydroxy! or is a macromolecuie; X is hydrogen or is any amino acid or ammo acid sequence; Z, 

is any amino acid sequence consisting of six amino acids; and n is any integer from 3 to 5 inclusive. 

Also provided herein are locked helix peptides of formula (If), formula (20, formula (30, formula 

(4£), formula(50, formula (60, formula (70, formula(Sf), formula (90, formula (100, formula (1 10, formula 
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(120, formula(i3f),formula(14f), formula ( 1 50, formula ( 1 6f), formula (17ft, formula ( 1 8f) t formula (.190 
and formula (200 wherein X contains up to or about 30 amino acid residues. 

Further provided herein are locked helix peptides of formula ( 1 f), formula (20, formula (30, formula 
(40, formula (50, formula (60, formd formula (10f), formula(1 10, formula 

5 (120, formula(130 ) fbrmula(140,Tormuia(l 50, formula (160, formula (170, formula (180, formula (190 
and formula (200 wherein X contains up to or about 25 amino acid residues. 

Additionally provided herein are locked helix peptides of formula (10, formula (20, formula (30, 
formula (4 0, formula (5 0, formula (60, formula (70, formula (SO* formula (90, formula (100, formula (110, 
formula (120, formula (130, formula (140, formula (150, formula (160, formula (170, formula (ISO, 
10 formula (1 90 and formula (200 wherein X contains up to or about 20 amino acid residues. 

Also encompassed herein are locked helix peptides of formula (10, formula (20, formula (30, 
formula (40, formuia (5 0, formula (60, formula (70, formula (80, formula (90, formula ( 1 00, formula (lift 
formula (120, formula (130, formula (140, formula (150, formula (160, formula (170, formula (180, 
formula (190 and formula (200 wherein X contains up to or about 15 amino acid residues. 
15 Further encompassed herein are locked helix peptides of formula (10, formula (20, formula (30, 

formula (40, formula (50, formula (60, formula (70, formula (80, formula (90, formula (100, formula(l 10, 
formula (120, formula (130, formula (140, formula (150, formula (160, formula (170, formula (180, 
formula (190 and formula (200 wherein X contains up to or about 10 amino acid residues. 

Additionally encompassed herein are locked helix peptides of formula (10, formula (20, formula 
20 (30, formula(40, formula (50, formula(60, formuia (70, formula (80, formula (90, formula (100, formula 
(1 10,formuia(120,formula(130, formula(l40, formula(150, formula(160, formula (170, formula (180, 
formula (190 and formula (200 wherein X contains up to or about 5 amino acid residues. 

Also within the scope of the invention are locked helix peptides of formuia (10, formuia(20, formula 
(30, formula(40, formula(50, formula(60, formula (70, formula (80, formula (90, formula (100, formula 
25 (110, formuia(120,formula(130, formula (Hf), formula(I50, formula(160, formula (170, formula (180, 
formula (190 and formula (200 wherein X contains up to or about 3 amino acid residues. 
The invention also provides locked helix peptides of formula (lg): 
0=C-(NHMCH 2 ) n -(NH)~C=0 

I ~ I 

30 (CH 2 ) m (CH 2 ) p 

I I 
(NH 2 )-(CH)-C— (NHMCH)-C-S 

if I! 
o o 

35 (lg) 

wherein S is hydroxylor is a macromolecuie; Z is any amino acid sequence consisting of six amino acids; m 

and p are independently selected from the integers 0 to 6 inclusive, provided that m+p is less than or equal to 
6; and n is any integer in the range defined by (7-(m+p)) to (9-(m+p)) inclusive, provided that n is greater than 
L 

40 In another embodiment, the invention provides locked helix peptides of formula (2g): 
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0=C-(NH>(CH-,) n -(NH)-C=0 

I " I 

I CH 2 

I I 
5 CH 2 " CH 2 

I I 
(NH 2 )-(CH)-C Z— -(NH)-(CH)-C-S 

II II 

o o 

10 (2g) . . , . . 

wherein S is hydroxy! or is a macromolecule;Z is any amino acid sequence consisting of six amino acids; and 

n is any integer from 4 to 6 inclusive. 

In yet another embodiment, the invention provides locked helix peptides of formula (3g): 
0=C-(NH)-(CH 2 ) n -(NH)-C=0 

15 I I 

CH 2 I 

S I 
CH 2 CH 2 

I I 
20 (NH 2 HCH)-C— -Z-— (NH)-(CH)-C-S 

II II 

o o 

(3g) , • - - . J 

wherein S is hydroxyl or is a macromolecule;Z is any amino acid sequence consisting of six amino acids; and 
25 n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (4g): 

0=C-(NH)-(CH 2 ) n -(NH)-C=0 

I * i 

CH 2 CH 2 

30 | I 

• CH 2 CH 2 . 

I I 
(NH 2 )-(CH)-C Z— (NH)-(CH)-C-S 

II II 
35 O Q 

, ■ • . . ■ 

wherein S is hydroxyl or is a macromolecule;Z is any amino acid sequence consisting of six amino acids; and 
n is any integer from 3 to 5 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (5g): 
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0=C-(NH)-(CH 2 ) n -(NH)-C=0 

I "I 
CH, CH 2 

I " I 
5 (NH 2 )-(CH)-C — -Z — (NH)KCH)-C-S 

li II 

o o 

(5g) 

wherein S is hydroxy! or is a macromotecule;Z is any amino acid sequence consisting of six amino acids; and 
10 n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (6g): 

O 



15 



H-N-C-(CH 2 ) r -(NH)-C-0 

i I 
(CH 2 ) q (CH 2 ) S 

I I 
(NH,)-(CH)-C~Z— (NH)-(CH)-C-S 

II II 
20 O O 

(6g) 

wherein S is hydroxyl or is a macromolecule; Z is any amino acid sequence consisting of six amino acids; q 
is selected from the integers 1 to 7 inclusive, and s is selected from the integers 0 to 6 inclusive, provided that 
q+s is less than or equal to 7; and r is any integer in the range defined by (7-(q+s)) to (9-(q+s)) inclusive, 
25 provided that r is greater than 0. 

In still another embodiment, the invention provides locked helix peptides of formula (7g): 

O 



H-N-C-(CH 2 ) n -(NH)-C=0 



30 



| CH, 

I I 
CH 2 CH, 

.1 I 
35 (NH 2 MCH)-C-Z-(NH)-<CH)-C-S 

II II 

o o 

(7g) ■ „ 

wherein S is hydroxyl or is a macromolecuIe;Z is any amino acid sequence consisting of six amino acids; and 

40 n is any integer from 4 to 6 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (8g): 
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10 



o 

II 

H-N-C-(CH 2 ) n -(NH)-C=0 

I " " I 

CH, |* 

I I 
CH 2 CH 2 

I 1 
{NH 2 )-(CH)-C-Z--(NH)-(CH)-C-S 



O O 

wherein S is hydroxyl or is a macromo!ecuie;Z is any amino acid sequence consisting of six amino acids; and 
n is any integer from 4 to 6 inclusive. 
15 In still another embodiment, the invention provides locked helix peptides of formula (9g): 

O 
II 

H-N-C-(CH 2 ) n -(NH)-C=0 

I I 
20 CH 2 CH 2 

I I 
CH 2 CH 2 

I I 
(NH 2 >(CH)-C--Z--(NH)-(CH)-C-S 

25 " || II 

o o 

<9g) 

wherein S is hydroxyl or is a macromolecu ie;Z is any am ino acid sequence consistmg of six amino acids; and 
n is any integer from 3 to 5 inclusive. 
30 In still another embodiment, the invention provides locked helix peptides of formula < 1 Og): 

O 

II 

H-N-C-(CH 2 ) n -(NH>C=0 

I I 
35 CH 2 CH 2 

I I 
(NH2)-(CH)-C-Z«(NH)-(CH)-C-S 

II II 

o o 

40 (lOg) 

wherein S is hydroxyl or is a macromoIecule;Z is any amino acid sequence consisting of six amino acids; and 

n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula ( 1 1 g): 
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O 
II 

0=C-(NH)-(CH,) u -C-N-H 

I I 
5 (CH,) t (CH 2 ) V 

I " I 
(NH 0 )-(CH)-C--Z»(NHMCH)-C-S 

II II 

o o 

10 (llg) 

wherein S is hydroxy! or is a macromolecule; Z is any amino acid sequence consisting of six amino acids; t 
is selected from the integers 0 to 6 inclusive, and v is selected from the integers 1 to 7 inclusive, provided that 
t+v is less than or equal to 7; and u is any integer in the range defined by (7-(t+v)) to (9-(t+v)) inclusive, 
provided that u is greater than 0. 
1 5 In still another embodiment, the in vention provides locked helix peptides of formula ( 12g): 

o 



0=C-(NH>(CH 2 ) n -C-N-H 

I I 
20 | CH 2 

I 1 
CH 2 CH 2 

I I 
(NH,)-(CH>C"Z--(NHMCH)-C-S 
25 || II 

o o 

(12g) 

wherein S is hydroxyl or is a macromolecule;Z is any amino acid sequence consisting of six ammo acids; and 
n is any integer from 4 to 6 inclusive. 
30 In still another embodiment, the invention provides locked helix peptides of formula ( 13g): 

o 

. II 

0=C-<NHMCH 2 ) n -C-N-H 

S I 

35 CH 2 I 

I I 

CH 2 CH 2 ..... ; 

• - ' I I 

(NH 2 )-(CH)-C-Z-(NH)-(CH)-C-S 

40 " || li 

o o 

(13g) 

wherein S is hydroxyl or is a macromolecule;Z is any amino acid sequence consisting of six amino acids; and 
n is any integer from 4 to 6 inclusive. 
45 In still another embodiment, the invention provides locked helix peptides of formula (14g): 
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o 



0=C-(NH)-(CH 2 ) n -C-N-H 

I I 
CH 2 CH 2 

I I 
(NH 2 >(CH)-C--Z-(NH)-(CH)-C-S 

I! ii 
o o 

wherein S is hydroxylor is a macromolecule;2 is any amino acid sequence consisting of six amino acids; and 
n is any integer from 5 to 7 inclusive. 

In still another embodiment, the invention provides locked helix peptides of formula (15g): 

O 

• ! li 

0=C-(NH)-(CH 2 ) n -C-N-H 

I i 
CH 2 CH-, 

■ I I 

20 : CH 2 ; CH 2 

I . I 
(NH 2 )-(CH>C-Z-(NH)-(CH)-C-S 

li I! 
O o 

25 ( 15 g) 

wherein S is hydroxyl or is a macromolecu)e;Z is any amino acid sequence consisting of six amino acids; and 

n is any integer from 3 to 5 inclusive. 

i In still another embodiment, the invention provides locked helix peptides of formula (I6g): 

i o I o 

30 1 11 i II 

•! H-N-C-(GH 2 ) X -C-N-H 

I I 1 I 

; (CH^ (CH 2 ) y 

! | i I 

35 (NH 2 )-C-C— Z— N-(CH)-C-S 

I II I II 

HO H O 

• ! (16g) . . 

wherein S is hydroxylor is a'macromolecule; Z is any amino acid sequence consisting of six ammo acids; w 

40 and y are independentlyselected from the integers 1 to 7 inclusive, provided that w+y is less than or equal to 

8; and x is any integer in the range defined by (7-(w+y))to (9-(w+y)) inclusive, provided that x is greater than 

or equal to 0. ; . 

' In still another embodiment, the invention provides locked helix peptides of formula ( 1 7g): 
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O O 

I! II 
H-N-C-(CH 2 ) n -C-N-H 

I I 
I CH : 



CH, CH 2 

I " I 
(NH 2 )-C-C — Z-— N-(CH)-C-S 

10 I II I II 

HO H O 

wherein S is hydroxy! or is a macromolecule;Z is any amino acid sequence consisting of six ammo acids; and 
n is any integer from 4 to 6 inclusive. 
1 5 in still another embodiment, the invention provides locked helix peptides of formula ( 1 8g): 

O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

I ' \ 

20 CH 2 I 

I I 

CH 2 CH 2 

I 1 
(NH 2 )-C-C--~ Z-~N-(CH>C-S 

25 E II I II 

HO H O 

(18g) ...... .„ , 

wherein S is hydroxylor is a macromolecule;Z is any amino acid sequence consisting of six ammo acids; and 

n is any integer from 4 to 6 inclusive. 
30 In still another embodiment, the invention provides locked helix peptides of formula (19g): 

O O 

II II 
H-N-C-(CH 2 ) n -C-N-H 

i i 
35 CH 2 CH 2 

I I 
(NH 2 )-C-C — Z-— N-(CH)-C-S 

i 11 I II ' ' 

HO HO 

40 (19g) -j . 

wherein S is hydroxyl or is a macromolecule;Z is any amino acid sequence consisting of six amino acids; and 

n is any integer from 5 to 7 inclusive. 

. In still another embodiment, the invention provides locked helix peptides of formula (20g): 



-70- 



S820036MJ _> 



WO 98/20036 



PCT/US97/20069 



O O 

I! II 

H-N-C-(CH 2 ) n -C-N-H 

i I 
CH 2 CH 2 " 

I I 
CH 2 CH 2 

! S 
(NH 2 )-C-C~~Z— N-(CH)-C-S 



10 



HO H O 

wherein S is hydroxyl or is a macromolecule;Z is any amino acid sequence consistingof six amino acids; and 
n is any integer from 3 to 5 inclusive. 
1 5 For locked helix peptides of formulas (I), (2), (3), (4), (5), (6), (7), (8), (9), ( 10), < 1 1 ), (12), (13), 

(14),(15),(16),<17),(18),(19),(20),(le),(2e),^^ 

(14e), (lSe), (16e), (17e), (18e), (19e), (20e), (If), (2f), (3f), (40, (5f), (6f), (7f), (8f), (9f), (10ft 0 1ft (12ft 
(13ft (14ft (15ft (16ft (17ft (18ft (19ft (20ft (lg), (2g), (3g), (4g), (5g), (6g), (7g), (8g), (9g), (lOg), 
(Ug),{12g),(13g),(14g),(15gX(16g),(17g),(18g),(19g), or (20g) bound to a mactomolecule, the invention 
20 encompasses any macromolecule capable of serving as an anchor for the C-terminus of the locked helix 
peptide. Typically, the macromolecule fiinctions as a solid support. In general, the solid support is an inert 
matrix, such as a polymeric gel, comprising a three dimensional structure, lattice or network of a material. 
Almost any macromolecule, synthetic or natural, can form a gel in a suitable liquid when suitably cross-linked 
with a difunctionalreagent. In one embodiment, the macromolecule selected is convenient for use in affinity 
25 chromatography. Most chromatographic matrices used for affinity chromatography are xerogels. Such gels 
shrink on drying to a compact solid comprising only the gel matrix. When the dried xerogel is resuspended 
in the liquid, the gel matrix imbibes liquid, swells and returns to the gel state, Xerogels suitable for use herein 
include polymeric gels, such as cellulose, cross-linkeddextrans(e.g. Sepharose), agarose, cross-linkedagarose, 
polyacrylamide gels, and poiyacrylamide-agarose gels. 
30 The locked helix peptides provided herein can be constructed according to the methods of the 

invention described in Sections II and III below. 

In one embodiment, the peptides of the invention are designed to isolate the binding determinants 
from a-helical binding domains of known proteins. Such peptides have a number of uses, including the 
determination of whethera binding determinant in an a-helical binding domain of a known protein can serve 
35 as a structural model for the design of peptidomimetics or small molecules capable of mimicking or 
antagonizingthe binding activity of the intact protein. In using the peptides of the invention for this purpose, 
the practitioner selects a binding protein with a helical domain that interacts with ligand, and then identifies 
a candidate binding determinant situated within a sequence of six (or more) contiguous amino acids in the 
helical binding domain. The candidate binding determinant can be identified by using alanine scanning 
40 mutagenesisto determine whether the candidate sequence contains one or more amino acid residues that are 
critical for ligand binding. Next, a constrained peptide containing the candidate sequence is designed by 
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selecting two residues in the candidate sequence (designated/ and/+7) which are separated by an intervening 
sequence of six amino acids and which do not interact with ligand (as determined by alanine scanning 
mutagenesis in the previous step) for substitution with amino acid residues having a side chain containing an 
amide bond-forming substitutent. The peptide is synthesized and the side chain amide bond-forming 
5 substitutent of the foreign / and /+7 residues are used to tether the peptide in an a-helical conformation 
according to the methods of the invention described in Section II below. Finally, the locked helix peptide's 
binding activity with the ligand is assayed, e.g., in a binding competition assay with the intact binding protein, 
and the results of the assay can be used to determine whether a peptidomimetic or small molecule antagonist 
could be developed using the binding determinant as a structural model. 
10 In another embodiment the lockedhelix peptides of the invention are used to replace intact binding 

proteins or protein binding domains in the affinity purification of ligand. For example, Protein A is commonly 
used for affinity chromatographicpurification of IgG molecules. The Z-domain of Protein A is a three helix 
bundle, 59 residues in length, which binds. to the Fc portion of IgG. As described in Example 2 below, a 
locked helix species of the peptide Phe-Asn-Met-(l)-GIn-Gln^Arg-Arg-Phe-Tyr-(2>Ala-Leu-His (wherein 
15 the amino acid residues at the (1) and (2) positions in the corresponding Z-domain sequence are both replaced 
with glutamic acid residues), corresponding to a binding determinant in helix 1 of the Z-domain can be used 
to bind IgG. Accordingly, the invention provides constrained helix species containing binding determinants 
in helix 1 of Hie Z-domain in Protein A, including molecules of formula (4) above wherein Z is Gln-Gln-Arg- 
Arg-Phe-Tyr. In one embodiment, the constrained helix species is a molecule of formula (4) wherein Z isGln- 
20 Gln-Arg-Arg-Phe-Tyr,X is Phe-Asn-Met,and Y is Ala-Leu-His. The IgG binding molecules of the invention 
are conveniently synthesized using the solid phase peptide synthesis methods described in Section II below, 
such that the molecules are anchored to resin beads suitable for column or batch affinity chromatography. 

In still anotherembodimenMhe lockedhelix peptides of the invention are designed to mimic epitopes 
in proteins and are used to selectively raise polyclonal or monoclonal antibodies against such individual 
25 epitopes. Since the peptides will frequently be too small to generate an immune response, the peptides can 
be conjugated to carriers known to be immunogenic in the species to be immunized, e.g., keyhole limpet 
hemocyanin, serum albumin, bovine thyroglobulin, or soybean trypsin inhibitor using a Afunctional or 
derivatizing agent, for example, maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteine 
residues), N-hydroxysuccinimide(throughlystneresidues) J glutaraldehyde,succinic anhydride, SOCl 2 , or R 1 ^ 
30 - C = NR, where R and R 1 are different alkyl groups. 

The locked helix peptides of the invention are particularly useful in isolating synthetic antibody 
clones with a selected binding activity from phage display combinatorial libraries. The locked helix peptide 
provides a significant advantage over the intact protein or protein domain in that using the locked helix peptide 
allows the isolation of binding activities for the particular conformational epitope of interest. Without the 
35 locked helix peptides of the invention, the conformation epitope of interest would likely require structural 
support from other regions of the protein or protein domain whose presence in the ligand would result in the 
concomitant isolation of undesired clones. In addition, the synthesis of locked helix peptides anchored to 
polymeric resins as described in Sections II and III below would provide material that can be conveniently 
packed into columns for panning phage display libraries. 
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In another aspect, the locked helix peptides of the invention are used to provide confonriationaliy 
stable variantsof peptides or proteins which exhibit "floppy" or unstable a-heiical secondary structure at one 
or more site(s) in unrestrained form under conditions of interest. In particular, the methods of the invention 
can solve problems presented by some antigens which relate to the instability of conformational epitopes, A 

5 conformationalepitope can fail to present the desired antigenic determinant because of "floppy" or unstable 
a-helical secondary structure elements) in the epitope. The restraint of such "floppy" a-hehcal structure(s) 
according to the methods of the invention would stabilize the conformational epitope and allow presentation 
of the desired antigenic determinant. This application of the present methods and peptides is particularly 
useful, for example, in vaccine design and in generating polyclonal or monoclonal antibodies from host 

10 animals or isolating antibody clones from phage display libraries. 

In one embodimentthe invention, where the lockedhelix peptidesof the invention are used to provide 
conformational stable variants of peptidesor proteins which exhibit "floppy" or unstable a-heiical secondary 
structure at one or more sites in unrestrained form under conditions of interest, a compound containing a 
constrained helical peptide that is useful as an immunogen, vaccine and diagnostic for human 

15 immunodeficiency virus (HTV) is provided. Acquired immunodeficiency syndrome (AIDS) is caused by a 
retrovirus identified as the human immunodeficiency virus (HIV). There have been intense efforts to develop 
a vaccine that induces a protective immune response based on induction of antibodies or cellular responses. 
Recent efforts have used subunit vaccines where an HIV protein, rather than attenuated or killed virus, is used 
as the immunogen in the vaccine for safety reasons. 

20 The human immunodeficiency virus 1 (HIV- 1 ) envelope glycoproteins gp 1 20 and gp4 1 mediate viral 

tropism to and subsequententry into target cells (Freed et aL, The Journal of Biological Chemistry 270, 23883- 
23886 (1995)). The role of gpl20 is to bind to target cells by interactions with CD4 and one of several co- 
recepto^ 

and cellular membranes. The mechanism by which gp41 mediates membrane fusion has recently been the 
25 subject of intensive study. Evidence suggests that the process may involve the formation of a coiled-coil 
trirner, which is thought to drive the transition from resting to fusogenic states, as modeled for influenza 
hemagglutinin (Wilson et aL, Nature 289, 366-373 (1981); Carr, et aL Cell 73, 823-832 (1993); Bullough 
et at., Nature 371, 37-43 (1 994)). 

Two linear peptides derived from HIV-1 gp41 have been found to inhibit viral fusion. The first of 
30 these, DP 107, represents a portion of gp4 1 near the N-terminai fusion peptide and has been shown to be helical 
in solution and oligomerize in a manner consistent with coiled-coii formation (Gailaher et aL, AIDS Res. Hum. 
Retrovir uses 5, 431^0 426-430(1997)). A more potent peptide, 

DP17S ? was derived from the C-terrninal region of the gp41 ectodomain (Wild, et aL, PNAS 91 : 9770-9774 
(1994); Jiang et aL, Nature, 365: 113 (1993)). Although this region of gp41 was predicted to be ^-helical 
35 (GailaheretaL,,4/DS/tes. Hum. Retroviruses 5, 431-440 (1989)), DP178 itself lacks discernable structure in 
solution (Wild, etal , PNAS 91: 9770-9774(1994). Attempts to explore the mechanism of action of DP 178 
have been complicated by a lack of understanding of its bioactive conformation. Recently, crystailographic 
(Chan et aL, Cell 89, 263-273 (1997); Weissenhorn et al., Nature 387, 426-430 (1997)) and solution 
(Lawless, et aL, Biochemistry 35, 13697-13708 (1996); Lu et aL, Nature Structural Biology ■% 1075-1082 

-73- 



BNSOOCID: <WO Sfi20036A1_L> 



WO 98/20036 



PCT/US97/20069 



(.1995); Rabenstein et al., Biochemistry 35, 13922-13928 (1996)) studies have shown that disconnected 
segments of HIV- 1 gp4 1 that overlap DP 1 07 and DP 1 7S associate in a tightly-packed helical bundle. The C- 
terminal segment, corresponding to DP178, forms an extended helix which packs in an antiparallel fashion 
against a groove created by an N-terminal (DP-1 07) coiied-coil trimer. While these data suggest one possible 
5 conformation for DP 178, they do not provide conclusive information about the mechanism of peptide 
inhibition during viral fusion events. 

The present invention provides helical constrained forms of DP 1 7 8 and homologous sequences and 
variants, overcoming the limitations in the art concerning DPI 7 8 and providingmore effective use of DP-178 
like sequences. Accordingly, in one embodiment of the invention is provided a constrained helical peptide 
1 0 having at least its internal amino acid sequence (and preferably adjacent amino acid sequences) selected from 
the C-terminal region of the HIV- 1 LAI isolate transmembrane protein gp4 1 ectodomain amino acids 633 to 
678, which overlap with the sequence corresponding to peptide DP- 178 (amino acid residues 643 to 673). 
This region is a 46-amino acid sequence (reading from the amino to carboxy terminus): NH2- 
WMEWEREIDNYTSLIHSL1EESQNQQEKNEQELLELDKWASLWNWF-COOH., 
15 Peptides in an alpha-helical coiled-coii conformation interact with one another in a characteristic 

manner that is determined by the primary sequence of each peptide. The tertiary structure of an alpha-helix 
is such that 7 am ino acid residues in the primary sequence correspond to approximately 2 turns of the alpha-, 
helix. Accordingly, a primary amino acid sequence giving rise to an alpha-helical conformation may be 
broken down into units of 7 residues each, termed heptads (having the form abcdefg). The core polypeptides 
20 are comprised of a series of heptads in tandem. When the sequence of a heptad is repeated in a particular core 
polypeptide, the heptad may be referred to as a "heptad repeat", or simply "repeat". 

According to the invention, embodiments are providedas compounds containing a constrainedhelical 
peptide that is composed of a peptide which contains a sequence of eight amino acid residues, where the 
sequence of eight amino acid residues has a first terminal residue and a second terminal residue, where the first 
25 terminal residue and the second terminal residue flank an internal sequence of six amino acids, wherein the 
first and second terminal residues have a side chain that are linked to each other forming a locking moiety to 
constrain the peptide to a helical form . The internal sequence of six amino acids has the form gabcde, defgab, 
or cdefga and has a sequences of six contiguousamino acids found in HIV- 1 LAI strain transmembrane protein 
gp41 amino acid sequence 633 to 678, in its homolog sequence from another HIV strain, in a consensus 
* 30 sequence of its homolog sequences from any one HIV clade, or amino acid substituted variant thereof 
According to the invention, each of the amino acids in the aforementioned sequences is assigned a position 
of a»b,c,d t e,f,org. The assignment is based on assigning the amino acid 633 ofthe HIVLAIgp41 633-678 
sequence to position a of a repeating abcdefg heptad assignment. Subsequent amino acids in the sequence 
~" are assigned positions accordingly. Figure 1 8 indicates the heptad positional assignment of each amino acid 

- ~ 35 in the sequence. The assignment can be readily applied to homologs and consensus sequences by aligning 
their amino acids to the corresponding amino acid in the representative HIV LAI sequence. The 633 amino 
acid or its corresponding amino acid in a homolog or consensus sequence is assigned position a, which begins 
the repeating abcdefg assignment pattern, 
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In these representative compounds and sequences shown in Figures 1 6- 1 8, the locking moiety or 
tether is between adjacent f positions when the internal sequence is of the form gabcde, adjacent c positions 
when the internal sequence is of the form defgab, or adjacent b positions when the internal sequence is of the 
form cdefga. In the most preferred embodiments the locking occurs between adjacent f positions, in which 
5 case the f position amino acids are replaced by amino acids suitable for providing a helix locking moiety. 
Figure 18 provides the alignment of the repeating abcdefg assignment with the sequences relevant to the 
invention. In a preferred embodiment the internal sequence of six amino acids has the form gabcde. These 
"internal sequence" of six amino acids from gp41 can substitute for moiety "Z" in any of the compounds, 
formulas, and synthetic methods taught herein. 
1 0 While the internal amino acid sequence is preferably from a sequence of six contiguous amino acids 

in HIV- 1 LAI strain gp4 1 amino acid sequence 633 to 678, in its homolog sequence from another HIV strain, 
or in a consensus sequence of its homolog sequences from any one HIV clade, it may be an amino acid 
substitutedvariantthereof. The sequences of the invention also include analogs of HIV gp4 1 sequence 633- 
678, truncations which may include, but are not limited to, peptides comprising the 633-678 sequence, 
15 containing one or more amino acid substations, insertions and/or deletions. The analogs of the sequencewill 
exhibit antiviral activity when in constrained peptides of the invention, and may, further, possess additional 
advantageousfeatures, such as, for example, increased bioavailability, and/or stability, or generate antibodies 
with increased HIV strain recognition. 

HIV- 1 and HIV-2 envelope proteins are structurally distinct, but there exists a striking amino acid 
20 conservation within the gp41 633-678 corresponding regions of HIV-1 and HIV-2. Amino acid 
substitutionsmay be of a conserved or non-conserved nature. Conserved amino acid substitutions consist of 
replacingone ormoreamino acidsof the 633-678 peptide sequence with amino acids of similar charge, size, 
and/or hydrophobicity characteristics, such as, for example, a glutamic acid (E) to aspartic acid (D) amino acid 
- substitution. Non-conservedsubstitutionsconsistof replacingone or more amino acids of the 633-678 peptide 
25 sequence with amino acids possessing dissimilar charge, size, and/or hydrophobicity characteristics, such as, 
for example, a glutamic acid (E) to valine (V) substitution^ 
i Deletions of the 633-678 region or its homologs are also within the scope of the invention. Such 

deletions consist of the removal of one or more amino acids with the lower limit length of the resulting peptide 
sequence being 6 amino acids for use as an internal sequence of a constrained helical peptide. Preferably the 
30 deletions retain sufficient amino acids such that at least two locks may be incorporated as taught herein. 
Examples of such deletions are the HIV35 peptide and its constrained helix compounds of the invention that 
have one lock (e.g. HIV 24) and two locks (e g. HIV 31). Most preferably, the deletions are terminal 
truncations, but in any event result in peptides which, when constrained along the f-b-c helical face, are still 
recognized by the coiled coil search algorithms used herein, or alternatively, retain the a-d helical face 
35 orientation and spatial arrangement of the parent molecule, or alternatively, can exhibit antifusogenic or 
antiviral activity. 

Most preferred compounds are those that, when used as immunogens, generate antibodies that 
neutralize HIV viral fusogenic activity or infectivity. 
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The peptidesof the invention may further include homolog sequences of the HIV LAI strain 633-678 
sequence which exhibit antivirai activity when in constrained helical form. Most preferably, the constrained 
peptides, when used as haptens, will generateantibodiesthat block viral fusion events, leading to an inhibition 
of viral infectivity. Such homologs are peptideswhose amino acid sequences are comprised of the amino acid 

5 sequences of peptide regions of other (i.e., other than HIV- 1 LAI) viruses that correspond to the gp41 peptide 
region of 633-678. Such viruses may include, but are not limited to, other HIV- 1 isolates and HIV-2 isolates. 
Homologs derived from the correspondinggp41 peptide region of other (i.e., non HIV-1LAI) HIV-1 isolates 
may include those provide in Figures 16A to 16G, or other known corresponding sequences. Particularly 
preferredare those derived from HIV- 1 SF2, HIV- 1 RF, and HIV-1 MM, GNE6, ONES, and Thai strain isolate 

10 A244. 

In a particularlypreferred embodiment, amino acids at positions a and d of the internal sequence of 
six amino acids are not amino acid substituted in the helical peptide, but are the amino acids in the known 
isolates or consensus sequences (see Figures 16A-16G and 17). 

Also most preferred are embodiments where die amino acids at positions g and e of the internal 

15 sequenceof six amino acids are not amino acid substituted in the helical peptide. An amino acid at any one 
of positions a, d, g, or e of the internal sequence of six amino acids can be conservatively substituted in the 
helical peptide in preferred embodiments. The a and d positions, and less directly the g and e positions, are 
believed to be those that are on the face of the constrained helix that interacts with the gp4 1 core trimer (see 
Figure 1 9). Since positions f, b and c are believed to not directly participate in binding, but rather serve to 

20 allow helix structure, preferred variations at positions b and c of the internal sequence of six amino acids are 
not amino acid substituted in the helical peptide, when not the locking (tethering) residues. Less preferred are 
compounds wherein an amino acid at any one of positions b, c, or f of the internal sequence of six amino acids 
is conservatively substituted or is any non-helix-breaking amino acid in the helical peptide, that does not 
interfere with the locking moiety. Preferred internal sequence chimeras are those in which an amino acid at 

25 any one of positionsa, d, g, or e of the internal sequence of six amino acids is substituted in the helical peptide 
with an amino acid from the corresponding position of a different HIV virus strain. 

Preferred compounds of the invention can include sequences from HI V- 1 clade consensus sequences: 
(clade B consensus) Wme W e r EI d n Y T? 1 1 y t L I e e s Q n Q Qe kN e q e L L e L dk W a s L w n 

Wf (SEQ ID NO: 109); 

30 (cladeA consensus)WL q WdKE I sn YT? II Y n L I E e S q n QQ E kN E q d L L A L D K W anL 
wnWF(SEQIDNO: 110); 

(clade C consensus) W M q W D R E I S N Y T d 1 1 Y r L L E D S Q N Q Q E r N E K D L L A L D S W k 
NLWNWF (SEQ ID NO: 111); 

(clade D consensus) W m e WErEI dNYTGHYsLIEeSQIQQ EKNEk eLLeLDKWAS 
35 LWNWF (SEQ ID NO: 1 12); and 

(clade E consensus) WIEWeREISNYTNqlYelLTeSQnQQDRNEKDLLeLDKWA 
S L W n W f (SEQ ID NO: 113). The amino acids in these sequences are represented by a single letter code, 
wherein a lower case letter is the represented amino acid or is substituted with an amino acid from that 
corresponding position in a sequence within the same clade, and wherein a ? is any amino acid from that 
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correspondingposition in a sequence from within the same clade. Most preferred are homologs or consensus 
sequences from Figures 16A-16G. The internal sequences are preferably found virus sequences in the group 
of HIV-1 clades consisting of clades A, B, C, D, E, F, G and F/B. 

While the locking moiety can be any structure that constrains the internal sequence to a helical 
5 peptide form and does not interfere with the a-d face (active face) of the constrained peptide, the preferred 
compounds use the locking chemistry taught herein. Compounds of the invention can have the first and 
second terminal residues with a side chain containing an amide bond-forming substituent that are linked to 
each other forming an amide bond to form a constrained helical peptide. The side chain amide bond-forming 
substituent of the first terminal residue and the side chain amide bond-forming substituent of the second 
10 tenninal residue are independently selected from the group consisting of an amino substituent and a carboxy 
substituent. The side chain amide bond-forming substituent of the first tenninal residue is a carboxy 
substituent, the side chain amide bond-forming substituent of the second tenninal residue is a carboxy 
substituent.and the difimctiona! linker is a diamine wherein the first and second functional groups are amino 
groups. In prefened form the first tenninal residue and the second terminal residue are independently selected 
1 5 from the group consisting of Asp and Glu, more preferred the first tenninal residue and the second terminal 
residue are both Glu. The first tenninal residues can have a D-thio-lysine side chain and the second terminal 

residue a L-thio- lysine that are linked to each other forming a disulfide bonded locking moiety to form a 

constrained helical peptide. 

In another embodimentthe constrainedpeptide is a hapten that is attached to a earner macromolecule, 
20 preferably covaiently linked to the constrained helical peptide, as discussed herein. The macromolecule can 
be linked to the helical peptide at the locking moiety or at amino acids at positions f, b, or c of the constrained 
helical peptide, and can be any carrier that does not interfere with the a-d face of the constrained helical 
peptide. A preferred canier for immunogenic purposes is keyhole limpet hemocyanin, or other earners 
discussed herein. 

25 In otherembodimentsthecompoundscontainmorethan one constrainedhelical peptide. The internal 

sequences of a first and a second constrained helical peptide in these embodiments are preferably different. 
The internal sequences of the first and second constrained helical peptides are from the same HIV gp41 
sequence or the same HIV clade consensus sequence, or amino acid substituted variant thereof. Theintemal 
sequences of the first and second constrained helical peptides are chosen from those that were separated by 

30 at least two helical turns (or six residues) in the HIV gp41 sequence or the same HIV clade consensus 
sequence, or amino acid substituted variant thereof. The compounds can further comprise a third constrained 
helical peptide. Again, the internal sequences of the first, second, and third constrained helical peptidesare 
preferably different In one example, the three sequences are present as separate constrained helical segments 
in a super helix of the polypeptide backbone of a 633-678 sequence as depicted in Figure 18. 

35 In other embodiments the compounds of the invention contain 1 to 38, 1 to 35, or more preferably, 

I to 19 amino acids flanking either or both terminal residues of the helical peptide. The flanking amino acids 
preferably are the flanking amino acids for the interna! sequence as found in a sequence from an HIV gp4 1 
sequence. 
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In yet other embodiments, the compounds further comprising a blocking group attached at either or 
both of the terminal residues of the helical peptide to prevent proteolytic degradation. The blocking group can 
contain a D-amino acid or a non-amide bond between adjacent flanking amino acids. 

Particularly preferred compounds include those in which a single lock is placed within sequence 
YTSLIHSLIEESQNQQEKNEQELLELD (SEQ ID NO: 2) sequence or a homolog sequence thereof, within 
EWDREINNYTSLIHSLIEESQNQQEKNEQE {SEQ ID NO: 107) sequence or a homo log sequence thereof, 
within YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNF (SEQ ID NO: 108) sequence or a homolog 
sequence thereof, to yield a constrained helical peptide. More than one constraint, preferably two, can be 
placed in these sequences, with examples shown in Figure 18. Shown in Figure 1 8 are locations of gabcde 
form helical sections when one, two or three r to H-7 locks are present in a 633-678 sequence or variant 
(truncated or sequence variant) thereof.. The two-lock variants (II), (ill), and HIV3 1, and the one-lock 
variants HIV24, (IX) and (XI) (Figure 1 8) are preferred compounds demonstrating preferred locking positions. 
Less preferredare the three^Iock variant, the two-lock (VI) and (VII) variants, and the one lock VIII and XII 
variants. Particularly preferred are the truncated variants HI V24 and HIV3 1 and their homologs from other 
i 5 HIV strains or consensus sequences or substitution variants thereof . Much less preferred are i to i+4 lock to 
constrain a "floppy" helical segment. 

In a preferred embodiment there are at least two constrained helical peptides in the compound, for 
example attached as different and independent haptens to KHL or a synthetic TASP or lysine network, or as 
two or more locked helical segments within a longer polypeptide sequence, preferably one that has a tendency 
20 to form an extended or super helical structure. The internal sequences of the first and second constrained 
helical peptidesare preferably different, for example as multiple haptens on a single hapten carrier or two or 
more locked helical segments within a longer super helix polypeptide sequence. In the latter case, the internal 
sequences of the first and second constrained helical peptides are preferably from the same HIV gp41 
sequence, the same HIV clade consensus sequence or the same amino acid substituted variant thereof The 
25 two helical peptides are attached to each other by a separating amino acid sequence, which can comprises 
from 5 to 7, 12 to 14, or 19 to 21 non helix-breaking natural or unnatural amino acids, and where preferably, 
the internal sequences of the first and second constrained helical peptides are from the same HIV gp4i 
sequence, the same HIV clade consensus sequence, or the same amino acid substituted variant thereof. The 
separating sequence can be a contiguous amino acid sequence selected from an intervening sequence that is 
30 located between the two internal sequences present in the same HIV gp41 sequence, the same HIV clade 
consensus sequence, or the same amino acid substituted variant thereof, and that excludes the two amino acids 
that correspond to the helical peptide locking positions immediately flanking the intervening sequence. An 
example is HIV3 1, in which the two constrained segments (internal amino acid sequences) are separated in 
the parent sequence (HIV35) by an eight amino acid sequence of which the amino acids at adjacent f positions 
35 used in locking are not considered part of the intervening sequence, such that the intervening sequence is a 
six amino acid sequence which is synthesized into the final peptide as a six amino acid separating sequence. 
The separating sequence is most preferably 6, 13, or 20 amino acids, in order to maintain alignment of a-d 
faces in between to constrained helical peptides. 
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The amino acids in the separating sequence retain abcdefg assignment positions of the intervening 
sequence, wherein preferably the amino acids in positions a and d in the separating sequence are identical to 
their correspondingintervening sequence amino acids. In addition, in preferred embodimentsthe amino acids 
in the separating sequence positions g and e also are identical to their corresponding intervening sequence 
5 amino acids. Less preferably, an amino acid at any one of positions a, d, g, or e is conservatively substituted 
in the separating sequence (with a sequence other than mat represented in the clade at that position). Most 
preferably, the amino acids in the separating sequence retain abcdefg assignment positions of the intervening 
sequence and an amino acid at any one of positionsa, d, g, or e is substituted in the separating sequence with 
a correspondingamino acid from its homolog sequence from another HIV strain, from a consensus sequence 

10 of its homolo g sequences from any one HIV clade, or from an amino acid substituted variant thereof. The 
amino acids in the separating sequence positions b, c, or f can be any non-helix-breakingamino acid, with the 
prefej-ences given in Figures 22 and 23A and B. Chimeras can be formed where an amino acid at any one of 
positionsa, d, g, or e of the internal sequence of six amino acids is substituted in the helical peptide with an 
amino acid from the correspondingposition of a differentHIV virus strain. Likewise substitutions of the same 

1 5 nature can be made in flanking or in separating sequences. Preferred are compounds wherein the internal 
amino acid sequence is from any one of the peptide sequences from Figure 23A and 23B. More preferably, 
the compound of the invention is selected from the group consisting of constrained helical peptides of each 
possiblesequence having any one or any combination of amino acid substitutions indicated in the constrained 
helical peptide series I to XII as shown in Figures 23A and 23B. In other embodiments, the compound is 

20 selected from the group consisting of constrained helical peptides of each possible sequence having any one 
or any combination of amino acid truncations indicated in the constrained helical peptide series I to XII as 
shown in Figures 23A and 23B. In yet other embodiments, the compound is selected from the group 
consisting of constrained helical peptides of each possible sequence having any one or any combination of 
amino acid substitutions indicated in the constrained helical peptide series 1 to XII as shown in Figures 23 A 

25 and i23B in combination with any one or any combination of amino acid truncations indicated in the 
constrainedheiical peptide series 1 to XII as shown in Figures 23A and23B. X in these sequences can be any 

A - 

non helix-breaking amino acid. 

j In yetanotherembodimentoftheinvention.peptidescomprisingthesequencesdescribed herein can 

be synthesized with additional chemical groups present at their amino and/or carboxy termini, such that, for 
30 example, the stability, bioavailability, and/or inhibitory activity of the peptides is enhanced. For example, 
hydrophobic groups such as carbobenzoxy 1, dansyl, or t-butyloxycarbonylgroups, may be added to the amino 
termini. An acetyl group of a 9-fluorenylmethoxy-carbonyI group may be placed at the amino termini. A 
hydrophobic group, t-butyloxycarbonyl, or an amido group may be added to carboxy termini. Furthermore, 
the peptides of the invention can be synthesized such that their steric configuration is altered. For example, 
35 the D-isomer of one or more of the amino acid residues of the peptide can be used, rather than the usual L- 
isomer. The compounds can contain at least one bond linking adjacentamino acids that is a non-peptidebond, 
and is preferably not helix breaking. Non-peptide bonds for use in flanking sequences include an imino, ester, 
hydrazine, semicarbazide, oxime, or azo bond. Still further, at least one of the amino acid residues of the 
peptides of the invention can be substituted by one of the well known non-naturally occurring amino acid 
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residues, that is preferably not helix breaking. Most preferably the non-natural amino acid or non-amide bond 
linking adjacent amino acids, when presents present outside of the internal sequence, and is, more preferably, 
not helix breaking. Still further, at least one ofthe amino acid residues of the peptides ofthe invention can 
be substituted by one ofthe well known non-naturally occurring amino acid residues. Alterations such as these 
5 can serve to increase the stability, bioavailability, immunogenicity, and/or inhibitory action ofthe peptides 
of the invention. 

While not wishing to be limited by any one theory, the constrained helical peptides are believed to 
derive their activity by interaction ofthe a-d face ofthe helix. The potent anti-HIV activity ofthe compounds 
ofthe invention derive from the gp41 633-678 region which corresponds to a putative alpha-helix region 

10 located in the C-terminal end ofthe gp41 ectodomain,and which appears to associatewith a distal site ongp41 
whose interactive structure is influenced by the leucine zipper motif, a coiled-coil structure, referred to as DP- 
1 07. The association of these two domains may reflect a molecular linkage or "molecular clasp" intimately 
involved in the fusion process (see Figures 18 and 19). The DPI 07 region forms a core trimer complex with 
a groove that recognizes and binds the a-d face ofthe helical peptides ofthe invention. 

1 5 When synthesized as peptides both DP-107 and DP-178 are potent inhibitors of HIV infection and 

fusion, probably by virtue of their ability to form complexes with viral gp4 1 and interfere with its fusogenic 
process; e.g., duringthe structural transitionof the viral protein from the native structure to the fusogenic state, 
the DP-107 and DP-178 peptides may gain access to their respective binding sites on the vira! gp41, and exert 
a disruptive influence. 

20 Consequently.when more than one constrained helical peptide is present, as part of a super helix or extended 
helix polypeptide backbone, the positions a and d of a first constrained helical peptide are in the same plane 
as positions a and d ofthe second constrained helical peptide. In other words, the a-d face ofthe two helices 
are aligned in the same plane. To achieve this orientation when the helices are in a polypeptide super helix, 
the first and second constrained helical peptides are separated by either 5 to 7, 12 to 14 or 19 to 21 natural or 

25 unnatural helix-forming amino acids. Preferably, the first and second constrained helical peptides are 
separated by either 6, 13, or 20 natural or unnatural helix-forming amino acids. A most preferred spatial 
alignment of the first.secbnd, and any additional constrainedhelical peptides is that found in DP107, wherein 
the a-d faces are aligned in the same plane to allow interaction with the grove in the core trimer. 

When the particularly preferred tethering chemistry as taught herein is used, the compounds ofthe 

30 invention are selected from the group consisting of: 
the compound represented by Formula (1): 

0=C-(NH)-(CH 2 ) n -(NH)-C=0 



35 



(CH 2 ) m (CH 2 ) p 

I I 
X-(NH)-(CH)-C Z— -(NH)-(CH>C-Y-S , 



O o 
(1) 

40 wherein S is absent or is a macromolecuie, 



-80- 



BNSDOCID: <WO 982003SA1_J_> 



WO 98/20036 PCT/US 97/20069 

X is hydrogen or is any amino acid or amino acid sequence, 

Y is absent, or is hydroxy! if S is absent, or is any amino acid or amino acid sequence, 
Z is an amino acid sequence consisting of six amino acids, wherein the internal sequence of six amino 
acids has the form gabcde, defgab, or cdefga and is selected from the group of sequences consisting of a 
5 sequence of six contiguous amino acids in HIV-1LAI strain gp41 amino acid sequence 633 to 678, in its 
homolog sequence from another HIV strain, in a consensus sequence of its homolog sequences from any one 
HIV clade, or amino acid substituted variant thereof, in which amino acid 633 or its corresponding amino acid 
in the homolog, consensus or variant sequence is assigned position a of a repeating abcdefg assignment; 

m and p are independently selected from the integers 0 to 6 inclusive, provided that xn+p is less than 
10 or equal to 6, and 

n is any integer in the range defined by (7-(m+p)) to (9-(m+p» inclusive, provided that n is greater 

than 1; 

the compound represented by Formula (6): 
O 



15 



H-N-C-(CH 2 ) r -(NH>C=0 
(CH 2 ) q - (GH 2 ) S 

I i 

20 X-(NH)-(CH)-C-Z—(NH>(CH>C-Y-S , 

II II 

o o 

(6) . 
wherein S is absent or is a macromolecule, X is hydrogen or is any amino acid or amino acid 

25 sequence, Y is absent, or is hydroxy! if S is absent, or is any amino acid or amino acid sequence, Z 

is an amino acid sequence consisting of six amino acids, wherein the internal sequence of six amino 
acids has the form gabcde, defgab, or cdefga and is selectedfrorn the group of sequences consisting 
of a sequence of six contiguous amino acids in HIV-ILAI strain gp41. amino acid sequence 633 to 
678, in its homolog sequence from another HIV strain, in a consensus sequence of its homolog 

30 sequences from any one HIV clade, or amino acid substituted variant thereof, in which amino acid 

633 or its corresponding amino acid in the homolog, consensus or variant sequence is assigned 
position a of a repeating abcdefg; q is selected from the integers I to 7 inclusive, s is selected from 
the integers 0 to 6 inclusive, provided that q+s . is less than or equal to 7, and r is any integer in the 
range defined by (7-{q+s)) to <9-(q+s)) inclusive, provided that r is greater than 0; 

35 the compound represented by Formula (11): 
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O 

II • 

0=C-(NH)-(CH 2 ) u -C-N-H 

I " I 

5 (CH 2 ) t (CH 2 ) V 

I I 
X-(NH)-(CH)-C-Z-(NH)-(CH)-C-Y-S ,. 

II II 

o o 

10 (11) 

wherein S is absent or is a macromolecuie, X is hydrogen or is any amino acid or amino acid 
sequence, Y is absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence, Z 
is an amino acid sequence consisting of six amino acids, wherein the internal sequence of six amino 
acids has the form gabcde, defgab, or cdefga and is selected from the group of sequences consisting 

15 of a sequence of six contiguous amino acids in HIV-1LA1 strain gp41 amino acid sequence 633 to 

678, in its homolog sequence from another HIV strain, in a consensus sequence of its homolog 
sequences from any one HIV clade, or amino acid substituted variant thereof, in which amino acid 
633 or its corresponding amino acid in the homolog, consensus or variant sequence is assigned 
position a of a repeating abcdefg assignment; t is selected from the integers 0 to 6 inclusive, and v 

20 is selected from the integers 1 to 7 inclusive, provided that tf-v is less than or equal to 7; and u is any 

integer in the range defined by (7-(t+v)) to (9-(t+v)) inclusive, provided that u is greater than 0; and 

the compound represented by Formula (16): 
O O 

II II 

25 H-N-C-(CH 2 ) X -C-N-H -- 

(CH 2 ) W (CH 2 ) y 

I I . ... . 

X-(NH)-C-C-«Z— -N-(CH)-C-Y-S , 

30 | || | || 

HO H O 

(16) • M ■ ■„ 

wherein S is absent or is a macromolecuie, X is hydrogen or is any amino acid or amino acid 
sequence, Y is absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence, Z 

35 is an amino acid sequence consisting of six amino acids, wherein the interna] sequence of six amino 

acids has the form gabcde, defgab, or cdefga and is selected from the group of sequences consisting 
of a sequence of six contiguous amino acids in HIV- 1 LAI strain gp41 amino acid sequence 633 to 
678, in its homolog sequence from another HIV strain, in a consensus sequence of its homolog 
sequences from any one HIV clade, or amino acid substituted variant thereof, in which amino acid 

40 633 or its corresponding amino acid in the homolog, consensus or variant sequence is assigned 

position a of a repeating abcdefg assignment; w and y are independently selected from the integers 
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I to 7 inclusive, provided that w+y is less than or equal to 8, and x is any integer in the range defined 
by (7-0w+y)) to (9-(w+y)) inclusive, provided that x is greater than or equal to 0. 
These compounds can further contain S' when S is absent and X is any amino acid or amino acid sequence, 
wherein S' is a macromolecule attached to-X. The X or Y can contain a blocking group that prevents 
5 enzymatic degradation. Standard terminal blacking groups as known in the art are suitable. X or Y can also 
contain a D-amino acid or a non-amide bond between adjacentamino acids to prevent enzymatic degradation. 

The compoundscan be formulated with a carrier as taught herein. When the helical peptide is to be 
used as a hapten the carrier can be an adjuvant. Typically, compositions of the invention are sterile. 
Compositions can contain at least two compounds of the invention, ether free or covalently or ionically 
10 attachedto one another. The peptides of the invention that have a virus fusion inhibitor activity, can be used 
in combination with other therapeutic agents, preferably in combination with another antiviral agent, to 
enhance its antiviral effect. Such antiviral agents include but are not limited to those which function on a 
different target molecule involved in viral replication, e.g., reverse transcriptase inhibitors, viral protease 
inhibitors, glycosylation inhibitors; those which act on a different target molecule involved in viral 
1 5 transmission;those which act on a different loci of the same molecule; and those which prevent or reduce the 
occurrence of viral resistance. 

In treating mammals, including humans, having a viral infection, a therapeutically effective amount 
of the compounds of the invention, or a pharmaceutically acceptable derivative, is administered is a dose 
sufficient to inhibit viral replication, either alone or in combination with other virus inhibiting drugs. For 
20 example H1V3 1 or HIV 24 can be administered as an infusion at about 0. 1 mg/kg to 1 .0 mg/kg per day for 
about 12 weeks. A preferable dose is from 20 mg to 35 mg. Doses can be administered in intervals of from 
about once per day to 4 times per day and preferably from about once every two days to once per day. A 
preferred dose is administered to achieve peak plasma concentrations of compound of from about 1 mg/mi 
to 10 mg/ml. This may be achieved by the sterile injection of about a 2.0% solution of the administered 
25 ingredients in buffered saline (any suitable saline solutions known to those skilled in the art of chemistry may 
be used). Desirable blood levels may be maintained by a continuous infusion as ascertained by plasma levels 
measured by HPLC. Pharmaceutical compositions containing me compounds of the invention can be 
administered*) a human patient, by itself or in pharmaceutical compositions where it is mixed with suitable 
carriers or excipient(s),as taught herein, at doses to treat a viral infection, in particular HIV infection. Suitable 
30 routes of administration include oral, rectal, transmucosal, or intestinal administration; parenteral delivery, 

includmgintramuscular,subcutaneous,intramedullaryinjections,as well as intrathecal, direct intraventricular, 

. intravenous, intraperitoneal, intranasal, or intraocular injections; transdermal, topical, vaginal and the like. 
Dosage forms include but are not limited to tablets, troches, dispersions, suspensions, suppositories, solutions, 
capsules, creams, patches, minipumps and the like. 
35 As discussed herein the compounds of the invention are particularly suited as haptens to raise an 

antibody that binds to the compound, preferably the antibody specifically binds an epitope comprising an 
amino acid at position a, d, e, or g in the helical peptide. Preferred antibodies are monoclonal. Antibodies 
of the invention,not only recognize the peptides of the invention, but preferably recognize the corresponding 
sequence when present in the virus. They may also bind unconstrainedDP17S. More preferably, the antibody 
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neutralizes HIV viral infectivity and/or neutralizes HIV virus membrane fusion. Thus the antibodies can 
recognize and bind gp41 sequence. 

In another embodiment is provided a method to immunize an animal, comprising administering to 
the animal an immunogenic amount of a compound of the invention, 
5 In yet another embodiment is provided a method to prophylactically or therapeutically treat a 

mammal at risk for or infected with HIV, comprising administering a composition comprising a 
prophylacticall y or therapeutically effective amount of a compound of the invention and a carrier While 
antibodies of the invention are expected to have broad viral activity, preferably, the composition comprises 
internal six amino acid sequences from different HIV strains or HIV clades. The compositions include a 

1 0 vaccine formulation. The formulations can contain one or more (multivalent) constrained helical peptides 
form different HIV strains, for use as a vaccine or immunogen. The composition can be administered, 
prophylactically or therapeutically, to a patient at risk of infection or in need of such treatment using the 
dosages and routes and means of administration that are readily determined. However, chronic administration 
may be preferred and dosages can be adjusted accordingly. 

15 Administration of the compounds containing the constrained helical peptides of the invention as a 

prophylactic vaccine (or therapeutic vaccine), can comprise administering to a host a concentration of peptides 
effective in raising an immune response which is sufficient to neutralize HIV, by, for example, inhibiting HIV 
ability to infect cells. The exact concentration will depend upon the specific peptide to be administered, but 
may be determined by using standard techniques for assaying the development of an immune response which 

20 are well known to those of ordinary skill in the art. The peptides to be used as vaccines are usually 
administered intramuscularly. The peptides may be formulated with a suitable adjuvant in order to enhance 
the immunological response. Such adjuvants may include, but are not limited to, mineral gels such as 
aluminum hydroxide; surface active substances such as lysolecithin, pluronic polyols, polyanions; other 
peptides; oil emulsions; and potentially useful human adjuvants such as BCG and Corynebacterium parvum. 

25 Many methods may be used to introduce the vaccine formulations described here. These methods include but 
are not limited to oral, intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, and intranasal 
routes. 

A compound of this invention in a suitable carrier or excipient is used to make a vaccine. The 
polypeptide can be used alone, but is preferably administered in a multivalent subunit vaccine that includes 

30 internal sequences from MN strain. The vaccine usually includes constrained helices representing 3 to about 
5 different strains, but 30 or more different gp41 -based constrained helical polypeptides can be used to provide 
a more effective vaccine. Of particular interest are gp4 1 sequences from breakthrough isolates of HIV vaccine 
trials. Use of a a homolog gp4 1 sequence from one or more of breakthrough isolates in a subunit vaccine, 
usually together a sequence from a commonly present isolate like the MN sequence, can provide protection 

35 against HIV strains that are sufficiently different from the common strain (e.g., MN) that the typical single 
subunit vaccine does not confer protection against those strains. 

Preparation of polypeptides for use in a vaccine is well known. The compound with the desired 
degree of purity and at a sufficient concentration to induce antibody formation is mixed with a physiologically 
acceptable carrier. A physiologically acceptable carrier is nontoxic to a recipient at the dosage and 
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concentration employed in the vaccine. Generally, the vaccine is formulated for injection, usually 
intramuscular or subcutaneous injection. Suitable carriers for injection include sterile water, but preferably 
are physiologic salt solutions, such as normal saline or buffered salt solutions such as phosphate-buffered 
saline or ringer's lactate. The vaccine generally contains an adjuvant. Useful adjuvants include QS21 (Quillaja 
5 saponaria, commercially available from Cambridge Biotech, Worcester, MA), which stimulates cytotoxic 
T-cells, and alum (aluminum hydroxide adjuvant). Formulations with different adjuvants which enhance 
cellular or local immunity, can also be used. 

Additional excipients that can be present in the vaccine include low molecular weight polypeptides 
(less than about 10 residues), proteins, amino acids, carbohydrates including glucose or dextrans, chelating 
10 agents such as EDTA, and other excipients that stabilize the protein or inhibit growth of microorganisms. 

The vaccine can also contain other HIV proteins. In particular, gp 120, or the extracellular portion 
of gp4I or HIV-1 core proteins such as P24, P17, and P55 can be present in the vaccine. Preferably, any 
gpI20 present in the vaccine is from an HIV isolate sequence represented in a constrained helical peptide 
present in the vaccine. 

1 5 Vaccine formulations generally include a total of about 10 to 5,000 ug of compound, more preferably 

about 100 to 1000 ug, even more preferably about 300 to 600 jig , conveniently in about 1.0 ml to 1.5 ml of 
carrrer. The amount of compound representing any one isolate or clade present in the vaccine will vary 
depending on the immunogenicity of the compound, For example, a constrained helical peptide with 
sequences from some strains of HIV may be less immunogenic than those from the MN strain. If peptides 

20 representingtwo strains having different immunogenicity are used in combination, empirical titration of the 
amount of each virus would be performed to determine the percent of the peptide of each strain in the vaccine. 
For isolates having similar immunogenicity, approximately equal amounts of each isolates peptide would be 
present in the vaccine. Methods of determining the relative amount of an immunogenic protein in multivalent 
vaccines are well known and have been used, for example, to determine relative proportions of various isolates 

25 in multivalent polio vaccines. 

The vaccinesare generally administered at 0, 1, and at 6, 8 or 12 months, depending on the protocol. 
A preferred protocol includes administration at 0, 1, 6, and 12 months. Following the immunization 
procedure, annual or bi-annual boosts can be administered. However, during the immunization process and 
thereafter, neutralizing antibody levels can be assayed and the protocol adjusted accordingly. 

30 The vaccine is administered to uninfected individuals. In addition, the vaccine can be administered 

to seropositive individuals to augment immune response to the virus. 

Although the compounds described herein can be used as a vaccine as described above, the 
compounds can also be used alone or in combinations in the same type of formulation, for use as an 
irnmunogen, to induce antibodies that recognize the isolate(s) present in the immunogen. Immunogens are 

35 formuiatedin the same manner as vaccines and can include the same excipients, etc. Antibodies induced by 
the immunogens can be used in a diagnostic to detect the HIV strain in patient sera or body fluid samples, or 
to affinity purify the particular gp41 molecule or virus. The compounds also find use in diagnostic assays to 
detect the presence of antibodies in HIV in sera from individuals suspected of being infected 
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In a further embodiment, the locked belix peptides of the invention are used to create constrained 
combinatorial peptide libraries. Combinatorial peptide libraries are uniquely suited to incorporate constrained 
peptides. The libraries are constructed with a "split synthesis" method in which a solid support (e.g. beads) 
is aliquoted equally and a different amino acfd is coupled separately to each portion. The portions are pooled, 
5 resplit and the process is repeated. In the "pep tides-on-beads ,f technique, this process yields a mixture of 
beads, each of which is coupled to a peptide of unique sequence. The bead mixture can be used directly in 
a binding selection, with binding detected colorimetrically and positive beads physically removed from the 
mixture for microsequencing (Ciackson and Wells, Tibtech . J2: 173-184 (1994)). To produce a library of 
peptides containing a random sequence of six (or more) amino acids locked into a helical conformation by / 

10 and 7+7 residues according to the invention, the split synthesis technique is modified to place / and 7+7 
residues in set positions separated by six residues in each random amino acid sequence, and the peptides are 
cyclized by linking the side chain amide bond-forming substituents of the J and 7+7 residues in each peptide 
using any of the methods described in Section II below. 

Combinatorial libraries containing the constrained peptides of the invention are a particularly 

1 5 powerful tool for identification of high affinity ligands in drug design. Given the prevalence of the a-helical 
motif in acti ve sites of binding proteins, including DNA binding proteins, and the absence of amino acid 
sequence constraints in the invention's tethering system, the locked helix peptides of the invention greatly 
increase the utility of combinatorial peptide libraries in screening methods for specific binding activities, such 
as the methods of U.S. Pat, No. 5,306,619 used to screen for DNA sequence-specific binding molecules. 

20 IL Methods for Constructing Synthetic Locked Helix Peptides 

According to the present method, an element of ot-heiical structure is removed from its context in a 
native protein by constructing a peptide with an amino acid sequence spanning the a-helical secondary 
structure of interest in the native protein, and constrainingthe short peptide into an c^helscal conformation that 
mimics the a-helical secondary structure of interest. The present methods enable the practitioner to lock into 

25 a helical conformation any peptide that is six amino acids in length by placing an amino acid with a side chain 
amide bond-forming substitutentat the N-terminusof the peptide and placing another amino acid with a side 
chain amide bond-forming substitutentat the C-terminus of the peptide, and then joining the side chain amide 
bond-forming substituents of the N-tenninal and C-terminai residues to form a cyclized structure which 
mimics the conformation of an a-helix. The present methods also enable the practitionerto lock into a helical 

30 conformation any sequence of six amino acid residues in a larger peptide by importing two residues with side 
chain amide bond-forming substituents into the N-terminal amino acid position and the C-terminal position 
amino acid position flanking the sequence (of six amino acid residues) of interest within a larger peptide, and 
then joining the side chain amide bond-forming substituents of the N-terminal and C-terrninal flanking 
residues to form a cyclized structure which mimics the conformation of an a-helix. 

35 There are at least two general methods for constructing the constrained helix peptides of the 

invention: (1) synthesis of a linearpeptide comprising* pair of residues that flank an amino acid sequence that 
is six residues in length, wherein the two flanking residues are independently selected from the group 
consisting of amino acid residues with side chain amide bond-forming substituents, followed by bridging the 
side chain amide bond-forming substituents of the flanking residues with a difunctional linker to cyclize the 
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peptide: and (2) synthesis of a linear peptide comprising a pair of residues that flank an amino acid sequence 
that is six residues in length, wherein the two flanking residues are independently selected from the group 
consisting of amino acid residues with side chain amide bond-forming substituents, and wherein one of the 
flanking residues is added to the peptide chain carrying a difunctionai linker such that one functional group 
5 of the linker is coupled to the residue's side chain amide bond-forming substitutent, fol lowed by coupling of 
the linker's free functional group to the side chain amide bond-forming substitutent on the other flanking 
residue to cyclize the peptide. 

Any amino acid that has a side chain containing a substitutent capable of forming an amide bond can 
be used as a flanking residue herein. Suitable flanking amino acid residues include amino acids with side 
1 0 chains carrying a free carboxy group, such as arninopropanedioic acid, Asp, Glu T 2-aminohexanedioic acid, 
and 2-aminoheptanedioic acid, and amino acids with side chains carrying a free amino group, such as 2,3- 
diaminopropanoicacid (2,3-diaminopropionicacid), 2,4-diaminobutanoicacid (2,4-diaminobutyricacid), 2,5- 
diaminopentanoic acid, and Lys. 

(1 ) Synthesis of Linear Peptide without Difunctionai Linker-Coupled Flanking Amino Acid 
15 a. Peptide Synthesis 

The desired peptide sequence is designed such that the sequence of six amino acid residues to be 
helicized extends between two flanking residues independently selected from the group consisting of amino 
acid residues with side chain amide bond-forming substituents. In one embodiment, the side chain amide 
bond-formingsubstituentsof the N-terminaJ and C-terminal flanking residues are independently selected from 
20 the group consisting of a carboxy substitutent and an amino substitutent In another embodiment, the side 
chain amide bond- forming substituents of the N-terminal and C-terminal flanking residues are both carboxy 
substituents. In yet another embodiment, the side chain amide bond-forming substitutent of one of the 
flanking residues is a carboxy substitutent and the side chain amide bond-forming substitutent of the other 
flanking residue is an amino substitutent. In still another embodiment, the side chain amide bond-forming 
25 , substituents of the flanking residues are both amino substituents. In yet another, embodiment, the flanking 
:-- residues are independently selected from the group consisting of arninopropanedioic acid, Asp, Giu, 2- 
aminohexanedioicacid, 2-aminoheptanedioic acid, 2-aminooctanedioic acid, 2-aminononanedioic acid, 2^ 
diaminopropanoic acid,2,4-diamm^ 
2,8-diaminooctanoic acid, and 2,9-diaminononanoic acid. 
30 m some embodiments,me desired peptide contam^ 

from the C-terminal flanking residue and/or N-terminai flanking residue. 

Once the desired peptide sequence is selected, chemical synthesis can be employed to construct the 
constrained helix peptide of the invention. This can be accomplished by modifying any one of a number of 
methodologieswell known in the art (see Kelley, R.F. & Winkler, M.E. in Genetip Engineering Principles and 
35 Methods . Setlow, J.K, ed., Plenum Press, RY., vol. 12, pp 1-1 9 (1990), Stewart, J.M Young, J.D., Solid Phase 
Peptide Synthesis . Pierce Chemical Co., Rockford, IL (1984); see also U.S. Pat. Nos. 4,105,603; 
3,972,859;3,S42,067; and 3,862,925) to produce a the desired peptide. 

Peptides of the invention can be conveniently prepared using solid phase peptide synthesis 
(Merrifield, J. Am. Chem. Soc. 85: 2149 (1964); Houghten, Proc. Natl. Acad. ScL USA, 82: 5132 (1985). 
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Solid phase synthesis begins at the carboxy terminus of the putative peptide by coupling a protected amino 
acid to an inert solid support. The inert solid support can be any macromolecule capable of serving as an 
anchor for the C-terminusof the initial amino acid. Typically, the macromolecuiar support is a cross-linked 
polymeric resin (e.g. a poiyamideor polystyrene resin) as shown in Figures 1-1 and 1*2, on pages 2 and 4 of 
5 Stewart and Young, supra. In one embodiment, the C-terminal amino acid is coupled to a polystyrene resin 
to form a benzyl ester. A macromolecuiar support is selected such that the peptide anchor link is stable under 
the conditions used to deprotect the a-aminp group of the blocked amino acids in peptide synthesis. If an 
base-labiie a-protecting group is used, then it is desirable to use an acid-labile link between the peptide and 
the solid support. For example, an acid-labile ether resin is effective for base-labile Fmoc-amino acid peptide 
1 0 synthesis as described on page 1 6 of Stewart and Young, supra. Alternatively, a peptide anchor link and a- 
protecting group that are differentially labile to acidolysis can be used. For example, an aminomethyl resin 
such as the phenylacetamidomethyl (Pam) resin works well in conjunction with Boc-amino agid peptide 
synthesis as described on pages 1 M2 of Stewart and Young, supra. 

After the initial amino acid is coupled to an inert solid support, the a-amino protecting group of the 
15 initial amino acid is removed with, for example, trifluoroacetic acid (TFA) in methylene chloride and 
neutralizing in, for example, triethylamine (TEA), Following deprotection of the initial amino acid's a-amino 
group, the next a-amino and side chain protected amino acid in the synthesis is added. The remaining a-amino 
protected and, if necessary, side chain protected amino acids are then coupled sequentially in the desired order 
by condensation to obtain an intermediate compound connected to the solid support. Alternatively, some 
20 amino acids may be coupled to one another to form a fragment of the desired peptide followed by addition 
of the peptide fragment to the growing solid phase peptide chain. 

The condensation reaction between two amino acids, or an amino acid and a peptide, or a peptide and 
a peptide can be carried out according to the usual condensation methods such as the axide method, m ixed acid 
anhydride method, DCC (N,N'-dicyclohexylcarbodiimide) or DIC (N^'-diisopropylcarbodiirnide) methods, 
25 active ester method, p-nitrophenyl ester method, BOP (benzotriazole-l-yl-oxy-tris [dimethylamino] 
phosphoniumhexafluorophosphate)method, N-hydroxysuccinicacid imido ester method, etc, and Woodward 
reagent K method. 

It is common in the chemical syntheses of peptides to protect any reactive side-chain groups of the 
amino acids with suitable protecting groups. Ultimately, these protecting groups are removed after the desired 

30 polypeptide chain has been sequentially assembled. Also common is the protection of the a-amino group on 
an amino acid or a fragment while that entity reacts at the carboxy group followed by the selective removal 
of the a-amino protecting group to allow subsequent reaction to take place at that location. Accordingly, it 
is common in polypeptide synthesis that an intermediate compound is produced which contains each of the 
amino acid residues located in the desired sequence in the peptide chain with various of these residues having 

3 5 side chain protecting groups attached. These protecting groups are then commonly removed at substantially 
the same time so as to produce the desired product following cleavage from the resin. Protecting groups and 
procedures for their use in peptide synthesis are reviewed in Protective Groups in Organic Synthesis , 2d ed. s 
Greene, T.W, and Wuts, P.G,M„ Wiley & Sons (New York: 1 991). 
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Suitable protecting groups for a-amino and side chain amino groups are exemplified by 
benzyloxycarbony](abbreviated Z), isonicotinyloxycarbonyl (uNOC), o-chiorobenzyloxycarbonyl [Z(2C1)], 
p-nitrobenzyloxycarbonyl [Z(tt0 2 )], p-methoxybenzyloxycarbonyl [Z(OMe)], t-butoxycarbonyl (Boc), t- 
amyioxycarbonyl(Aoc), isoboroyloxycarbonyladamaniyloxyc^ 
5 (Bpoc), 9-fluorenylmethoxycarbonyKFmoc), methy Isulfonyethoxycarbonyl (Msc), trifluoroacetyl, phthalyl, 
formyl, 2-nitrophenyisulfenyl (NPS), diphenylphosphinothioyl (Ppt), and dimetbylphosphinotbioyl (Mpt) 
groups, and the like. 

1 Protective groups for the carboxy functional group are exempHfiedby benzyl ester, (Obz), cyclohexyl 
ester (Chx), 4~nitrobenzyl ester (Onb), t-butylester (Obut), 4-pyridyimethyl ester (Opic), and the like. It is 
1 0 often desirable that amino acids such as argin ine, cysteine, and serine possessing a functional group other than 
amino and carboxy groups be protected by a suitable protecting group. For example, the guanidino group of 
argintne may be protected iwith nitro, p-toluenesulfonyl, benzyloxycarbonyi, adamantyloxycarbonyl, p- 
methoxybenzenesulfonyl, 4-methoxy-2, 6-dimethylbenzenesulfonyl (Nds), U3,5-trimethylphenysulfonyl 
(Mts), and the like. The thiol group of cysteine can be protected with p-methoxybenzyl, trityl, and the like. 
15 in one embodiment, the peptides of the invention are synthesized with the help of blocking groups 

thatprotectthe side chain amide bond-form ingsubstituentsof the N-terminaland C-tenninal flanking residues. 
The protecting group or groups used for the side chain amide bond-fonningsubstituents of the N-terminal and 
C-terminai flanking residues can be the same or different than the protecting group or groups used to block 
the side chain functional groups of other residues in the peptide. In a preferred embodiment, the protecting 
20 group or groups used to block the side chain amide bond-formingsubstituents is (are) differentially removable 
with respect to the protecting groups used for other side chain functional groups in the peptide, i.e. the side 
chain; amide bond-formingsubstituents can be deprotected without deprotecting the other side chain functional 
grouris in the peptide, in addition to being differentially removable with respect to the a-amino protecting 
group used in peptide synthesis. In another preferred embodiment, the side chain amide bond-forming 
25 subs&tuents of the flanking residues are orthogonally protected with respect to each other such that the side 
chain* amide bond-forming substituent of one flanking residue can be deprotected without deprotectingthe side 
chairi amide bond-forming substituent of the other flanking residue. 

I Suitable protecting groups for use in orthogonally protecting the side chain amide bond-forming 
substituentsof the flanking residues with respect to other functional groups and/or with respect to each other 
30 include pairs of differentially removable carboxy protective groups, such as a reduction-labile carboxy 
protective group, e.g. allyl or benzyl esters, paired with a base-Iabiie carboxy protective group, e.g. 
fluoteny Imethy Jesters, methyl or other primary alkyi esters. Fluorenylmethyl, methyl or other primary aikyl 
groups or other base-labile carboxy protective groups can be removed from their corresponding esters to yield 
a free carboxy group (without deprotecting allyl or benzyl esters or other reduction-labile esters) by 
35 saponificationofthe esters with a suitable base such as piperidine and sodium hydroxide in a suitable solvent 
such as dimethylacetamide, or methanol and water, for a period of 10 to 120 minutes, and preferably 20 
minutes, at 0 to 50°C, The allyl or benzyl or other reduction-labile esters can be removed when desired by 
reduction in the presence of a suitable transition metai catalyst, such as Pd(PPh 3 ) 4 , PdCPPl^C^, Pd(OAc) 2 
or Pd on carbon with a source of hydrogen, e.g. H 2 gas, in a suitable solvent such as dimethylacetamide, 
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dimethylformamide, N-methylpyrrolidinoneor methanol for a period of 10 to 500 minutes, and preferably 100 
minutes, at 0 to 50°C. For the sake of simplicity and convenience, all carboxy protective groups that are 
removable by Pd-catalyzed methods which result in the reduction of the protected carboxy substitutent are 
included in the term "reduction-labile protective groups" as used herein, even though such Pd-cataiyzed 
5 deprotection methods may not result in the reduction of the protective group in question. 

In embodiments wherein Pd catalysis involves the formation of intermediates of Pd derivatized with 
reduction-labile protecting groups, e.g. Pd-allyl derivatives, the Pd catalyst can be restored by reaction with 
a suitable nucleophile, such as piperidineor tributyltin hydride. When such reduction-labile groups are used 
to provide orthogonal protection in combination with base- labile protecting groups, it is preferable to either 

1 0 ( 1 ) util ize a synthetic scheme that calls for the removal of the base- labile protecting groups before the removal 
of the reduction-labile protecting groups or (2) restore the Pd catalyst with a nucleophile that does not 
deprotect the base-labile protecting groups. 

Alternatively, the carboxy substituents of the flanking residues can be orthogonally protected with 
respect to other functional groups and/or with respect to each other by using an acid-labile protecting group, 

15 such as a tertiary alky! ester, - e.g. t-butyl ester, in combination with a reduction-labile protecting group, such 
as the ally 1 or benzyl esters described above. The tertiary alky 1 or other acid-labile ester group can be removed 
by acido lysis, e.g. with trifluoroacetic acid in methylene chloride, and the aliyl or beiizy i or other reduction- 
labile esters can be removed by reduction in the presence of a transition metal catalyst as described above. 
In another embodiment, the carboxy substituents of the flanking residues can be orthogonally 

20 protected with respect to other functional groups and/or with respect to each other by using a fluoride ion- 
labile protecting group, such as 2-(trimethylsilyl)ethy land silyl esters, in combination with a reduction-labile 
protecting group, such as the allyl or benzyl esters described above, or in combination with a base-labile 
protecting group, such as the fluorenylmethyl, methyl or other primary alky 1 esters described above, without 
deprotectingthe reduction- labile or base-labile esters. The 2-(trimethy Isily i)ethy t.sity I or other fluoride-labile 

25 ester group can be removed by reaction with a suitable fluoride ion source, such as tetrabutyiammonium 
fluoride in the presence of a suitable solvent, such as dimethylacetamide(DMA), dtmethylformamide (DMF), 
tetrahydrofuran (THF) T or acetonitrile. 

Suitable protecting groups for use in orthogonally protecting the side chain amide bond-forming 
substituentsof the flanking residues with respect to other functional groups and/or with respect to each other 

30 also include pairs of differentially removable amino protective groups, such as an allyloxycarbony 1 or other 
... reduction-labile amino protective group paired with a t-butoxycarbonyl (Boc) or other acid-labile amino 
protective group, and a reduction-labile amino protective group paired with a fluorenylmethoxycarbonyl 
(Fmoc) or other base-labile amino protective group. An aliyloxy carbonyl (or other reduction-labile blocking 
group) protected amino group can be deprotected by reduction using a transition metal catalyst as in the 

35 procedures for removing reduction-labile carboxy protective groups described above, without deprotecting 
a Boc or Fmoc protected amino group. Likewise, an acid-labile amino protective group and a base-labile 
amino protective group can be removed by acidolysisand base saponification, respectively, without removing 
a reduction-labile amino protective group. For the sake of simplicity and convenience, all amino protective 
groups that are removable by Pd-catalyzed methods which result in the reduction of the protected amino 
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substitutentare included in the term "reduction-iabileprotective groups" as used herein, even though such Pd- 
catalyzed deprotection methods may not result in the reduction of the protective group in question. 

In anotherembodiment,the amino substituents of the flanking residues can be orthogonally protected 
with respect to other functional groups and/or With respect to each other by using a fluoride-labile protecting 
5 group, such as 2-trimethylsilylethylcarbamate(Teoc), in combination with a reduction-lab iieprotecting group, 
such as allyloxylcarbonyl, or in combination with a base-labile protecting group, such as 
fluorenylmethoxycarbonyi, as described above. The Teoc or other fluoride-labile group can be removed by 
reaction with a suitable fluoride ion source, such as tetrabutylammonium fluoride, as in the procedures for 
removal of fluoride-labile carboxy protective groups described above, without deprotecting an 
1 0 allyloxycarbonyl or fluorenylmethoxycarbonyi protected amino group. Likewise, a reduction-labile amino 
protective group and a base-labile amino protective group can be removed by reduction and base 
saponification, respectively, without removing a fluoride- labile amino protective group. 

In embodimentsthat use a carboxy substituent. as the side chain amide bond-forming substituent of 
one flanking residue and that use an amino substituent as the side chain amide bond-forming substituent of 
15 the other flanking residue, the carboxy substituent and the amino substituent can be orthogonally protected 
with respect to each other by using a reduction-labileprotecting group to block one substituent, e.g. allyl ester 
or allyloxycarbonyl.and a fluoride-labile,acid-labileor base-labileprotectinggroup to block other substituent, 
e.g. silyl ester, t-butyl ester, fluorenylmethyl ester, Teoc, Boc, or Fmoc. 

In a preferred embodiment, a reduction-labileprotecting group is used to block the side chain amide 
20 bond-forming substituent of one flanking residue and the protecting group for the side chain amide bond- 
forming substituent of the other flanking residue is selected such that it provides orthogonal protection with 
respect to both the reduction-labile protecting group and the a-amino protecting group used in peptide 
synthesis. Forexample, in an embodiment using Fmoc chemistry for peptidesynthesis, orthogonal protection 
of the side chain amide bond-forming substituents would be provided by a reduction-labile protecting group 
25 and an acid-labileprotectinggroup, ••= Likewise, in an embodiment using Boc chemistry for peptide synthesis, 
orthogonal protection of the side chain amide bond-forming substituents would be provided by a reduction- 
labile protecting group and a base-labile protecting group. 

In yet another preferred embodiment, the side chain amide bond-formingsubstituents of the flanking 
residues are orthogonally protected with respect to each other, with respect to ct-amino protecting group used 
30 in peptide synthesis, and with respect to me prote^ 

in the peptide chain. - • •• •' " - 

In stilianomerprefertedembodiment,the side ^ 
residues are orthogonally protected with respect to each other, and with respect to a-amino protecting group 
used in peptide synthesis, but only one of the side chain amide bond-forming substituents is orthogonally 
35 protectedwimrespecttotheprotectinggroupsusedtoblockother sidechain functional groups in the peptide 
chain. In mis embodiment, it is preferable to use the side chain amide bond-forming substituent with fully 
orthogonaiprotectionas the targetfor initial coupling of the peptide to the Afunctional linker. Since the side 
chain amide bond-forming substituent with fully orthogonal protection can be deprotected without 
deprotecting other functional groups, the coupling reaction will be specific to the desired side chain amide 
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bond-formmgsubstituent,andwill reduce the production of unwanted peptide/difunctional linker derivatives. 
Although cyciization will require the deprotection of the side chain amide bond-forming substituent of the 
other flanking residue, and may cause concomitant deprotection of other side chain functional groups, 
unwanted derivatives are less likely to form given that the peptide chains are anchored to a solid support and 
5 that the linker length will regioselectively favor a coupling reaction between the unbound functional group 
of the linker and the side chain amide bond-forming substituent of the other flanking residue. If further 
peptide chain synthesis is desired after cyciization, any side chain functional groups on other amino acid 
residues left unprotected by the cyciization reactions can be reprotected before chain synthesis is resumed. 

Many of the blocked amino acids described above can be obtained from commercial sources such 
10 as Novabiochem (San Diego, CA), Bachem CA (Torrence, CA) or Peninsula Labs (Belmont, CA). 

In addition, the methodsof the invention can be practiced in conjunction with solution phase peptide 
synthesis, for example, the solution phase peptide, synthesis methods described in Principles of Peptide 
Synthesis . 2d ed, M. Bodanszky, Springer-Verlag (1993) or in The Practjce of Peptide Synthesis , 2d ed, M. 
Bodanszky and A. Bodanszky, Springer-Verlag (1994). It will be appreciated that solution phase peptide 
15 synthesis methods can be easily modified to incorporate the desired flanking residues, with or without 
orthogonally-protected side chain amide bond-forming substituents, into the peptide chain of interest, using 
procedures similar to those used in the solid phase peptide synthesis methods described herein. It will be 
further appreciated that all references to peptide synthesis herein encompass both solid phase and solution (or 
liquid) phase peptide synthesis methods, unless otherwise indicated. 
20 Peptide Cyciization 

After the desired amino acid sequence has been completed, the linear peptide is cyclized in order to 
constrain the peptide in a helical conformation. Any method of bridging the side chain amide bond-forming 
substituents of the flanking residues with a difunctional linker is suitable for producing the constrained helical 
peptides of the invention. 
25 (i) Selection of Difunctional Linker 

Typically, the difunctional linker suitable for use herein is capable of presenting two functional 
groups separated by a distance of or about 5 A to or about 30 A, and preferably of or about 8 A to or about 1 4 
A, and more preferably of orabout 10 A, such that the side chain amide bond-forming substituent of one of 
the flankingresidues can form an amide linkage with one or either of the functional groups of the linker and 
30 the side chain amide bond-forming substituent of the other flanking residue can form an amide linkage with 
the remaining functional group of the linker. It will be appreciated that the nature of the molecular scaffold 
used to presentthe desired functional groups in the proper spatial relationship is not critical to the practice of 
the invention. Although straight chain and branched aikyl scaffolds are suitable for use herein, the invention 
is not so limited. For example, alkenyl, alkynyi, cycloalkyl, or other aliphatic hydrocarbon species, with or 
35 without heteroatoms s and monophenyl, biphenyl, naphthyl, and other aromatic hydrocarbon species, with or 
without heteroatoms t that are substituted with the desired functional groups in the proper spatial relationship 
(e.g. para- or meta-substitutions in ring structures such as monophenyl, biphenyl, naphthyl and the like) can 
be used to link the side chain amide bond-forming substituents of the flanking residues. 
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The functional groups used in the difiinctiona! linker are selected such that they are capable of 
forming amide linkages with the side chain amide bond-forming substituents of the flanking residues used in 
the peptide to be cyclized. In embodiments wherein the side chain amide bond-forming substituent of each 
flanking residue is a carboxy substituent, the peptide can be conveniently cyclized with a diamine linker. In 

5 one example, the flanking residues and the diamine linker are selected according to Table 1 below. It will be 
appreciated that each of the flanking residues and linker molecules listed in Table 1 below is considered to 
representnot only the particular molecule correspondingto the given chemical name under IUPAC rules, but 
also any variant of the molecule containing additional substituents or modified substituents which do not 
prevent or substantially alter the functioning of the amino and/or carboxy groups contained in the molecule, 

10 which functioning is necessary for use of the molecule in the methods of the invention. Accordingly, each 
molecule listed will be understood to encompasses variant molecules containing alkenyl, alkynyi and other 
unsaturated bonds, heteroatoms, cycloalkyl substituents, aromatic substituents, or other substituents in the 
carbon backbone of the molecule, and/or variants containing the foregoing or other substituents or groups in 
place of hydrogen atoms on the carbon backbone of the molecule. 
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10 



20 



25 



30 



Item Flanking 
No. Residue#l 



Table I 

Flanking 
Residue#2 



Diamine 
Linker 



1 aminopropanedioic acid 



2 aminopropanedioic acid 



3 aminopropanedioic acid 



15 4 aminopropanedioic acid 



5 aminopropanedioic acid 



6 aminopropanedioic acid 



7 aminopropanedioic acid 



8 aspartic acid 



9 aspartic acid 



10 aspartic acid 



35 1 1 aspartic acid 



12 aspartic acid 



40 1 3 glutamic acid 



14 glutamic acid 



45 



aminopropanedioic acid 

aspartic acid 

glutamic acid 

2-aminohexanedioic acid 

2-aminoheptanedioic acid 

2-aminooctanedioic acid 

2-aminononanedioic acid 
aspartic acid 

glutamic acid 

2-aminohexanedioic acid 

2-aminoheptanedioic acid 

2-aminooctanedioic acid 
glutamic acid 

2-aminohexanedioic acid 
-94- 



1 .7- diaminoheptane; 

1 .8- diaminooctane; 
U9-diam inononane 

1 .6- diaminohexane; 

1 .7- diaminoheptane; 

1.8- diaminooctane 

1 ,5-diaminopentane ; 
L,6-dtaminohexane; 
1 ,7-diarninoheptane 

I ,4-diaminobutane; 

1 .5- diaminopentane ; 

1 .6- diaminohexane 

1 .3- diam inopropane ; 

1 .4- diaminobutane; 

1 .5- diaminopentane 

\ ,2-diaminoethane; 

1 .3- diaminopropane; 

1 .4- diaminobutane 

1 .2- diam inoethane ; 

1 .3- diaminopropane 

i ,5-diaminopentane; 

1 .6- diaminohexane; 

1 .7- diaminoheptane 

1 .4- diaminobutane ; 

1 .5- diaminopentane; 

1 .6- diaminohexane 

1 ,3-diaminopropane; 

1 .4- diaminobutane; 

1 .5- diaminopentane 

1 .2- dtaminoethane; 

1 .3- diaminopropane; 

1 .4- diaminobutane 

1 T 2-diam inoethane; 
1 ,3-diaminopropane; 

1.3- diaminopropane; 

1 .4- diaminobutane; 

1 .5- diaminopentane 

1 .2- diaxninoethane; 

1 .3- diaminopropane; 
i ,4-diaminabutane 



BNSDOCID; <WO 9B20036A1_?_> 



WO 98/20036 



PCT/US97/20069 



10 



15 



20 



25 



15 glutamic acid 



16 2-aminohexanedioic 
acid 



40 



2-aminoheptanedioic acid 
2-aminohexanedioic acid 



L2~diaminoethane; 
1 ,3-diaminopropane 

1.2- diarninoethane; 

1 .3- diam inopropane 



In embodimentswherein the side chain amide bond-forming substituent of each flanking residue is 
an amino substituent, the peptide can be conveniently cyclized with a dicarboxyiic acid linker. In one 
example, the flanking residues and the dicarboxyiic acid linker are selected according to Table 2 below. It 
will be appreciated that each of the flanking residues and linker molecules listed in Table 2 below is 
considered to represent not only the particular molecule corresponding to the given chemical name under 
IUPAC rules, but also any variant of the molecule containing additional substituents or modified substitueiits 
which do not prevent or substantially alter the functioning of the amino and/or carboxy groups contained in 
the molecule, which functioning is necessary for use of the molecule in the. methods of the invention. 
Accordingly, each molecule listed will be understood to encompasses variant molecules containing alkenyl, 
alkynyl and other unsaturated bonds, heteroatoms, cycloalkyl substituents, aromatic substituents, or other 
substituents in the carbon backbone of the molecule, and/or variants containing the foregoing or other 
substituents or groups in place of hydrogen atoms on the carbon backbone of the molecule. 

Table 2 



Item 
No. 



Flanking 
Residue#l 



3 2,3-diaminopropanoic acid 

30 



4 2,3-diaminopropanoic acid 



35 5 2,3-diaminopropanoic acid 



6 2,3-diaminopropanoic acid 



7 2,3-diaminopropanoic acid 



Flanking 
Residue#2 



1 2,3-diaminopropanoic acid 2,3-diaminopropanoic acid 

2 2,3-diaminopropanoic acid 2,4-diaminobutanoic acid 



2,5-diaminopentanoic acid 



lysine 



2,7-diaminoheptanoic acid 



2,8-dtaminooctanoic acid 



2,9-diaminononanoic acid 



Dicarboxyiic acid 
Linker 



heptanedioic acid; 
octanedioic acid; 
nonanedioic acid 

hexanedioic acid; 
heptanedioic acid; 
octanedioic acid 

pentanedioic acid; 
hexanedioic acid; 
heptanedioic acid 

butanedioic acid; 
pentanedioic acid; 
hexanedioic acid 

propanediol acid; 
butanedioic acid; 
pentanedioic acid 

ethanedioic acid; 
propanediol acid; 
butanedioic acid 

ethanedioic acid; 
propanediol acid 
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8 2,4-diaminobutanoic acid 



9 2,4-diaminobutanoic acid 



10 2,4-diaminobutanoic acid 



10 1 1 2 T 4-diaminobutanoic acid 



12 2,4*diaminobutanoic acid 



15 13 2,5 -diarninopentanoic acid 



20 



25 



30 



35 



14 2,5 -diarninopentanoic acid 



1 5 2,5-diaminopentanoic acid 



16 lysine 



2y4-diaminobutanoic acid 



2 > 5-diaminopentanoic acid 



lysine 



2,7-diaminoheptanoic acid 



2,8-diaminooctanoic acid 



2 t 5-diaminopentanoic acid 



lysine 



2,7-djaminoheptanojc acid 



lysine 



pentanedioic acid; 
hexanedioic acid; 
heptanedioic acid 

butanedioic acid; 
pentanedioic acid; 
hexanedioic acid 

propanedioic acid; 
butanedioic acid; 
pentanedioic acid 

ethanedioic acid; 
propanedioic acid; 
butanedioic acid 

ethanedioic acid; 
propanedioic acid 

propanedioic acid; 
butanedioic acid; 
pentanedioic acid 

ethanedioic acid; 
propanedioic acid; 
butanedioic acid 

ethanedioic acid; 
propanedioic acid 

ethanedioic acid; 
propanedioic acid 



In embodiments using an amino substituentas the side chain amide bond-forming substituent of one 
flanking residue and a carboxy substituent as the side chain amide bond-forming substituent of the other 
flanking residue, the peptide can be conveniently cyclized with an ammo-substituted carboxylic acid 
(aminocarboxylicacid) linker. In one example, the flanking residues and the aminocarboxylic acid linker are 
selected according to Table 3 below. It wilt be appreciated that each of the flanking residues and linker 
molecules listed in Table 3 below is considered to represent not only the particular molecule corresponding 
to the given chemical name under 1UPAC rules, but also any variant of the molecule containing additional 
substituentsor modified substituents which do not prevent or substantially alter the functioning of the amino 
and/or carboxy groups contained in the molecule, which functioning is necessary for use of the molecule in 
the methods of the invention. Accordingly, each molecule listed will be understood to encompasses variant 
molecules containing alkenyl, alkynyl and other unsaturated bonds, heteroatoms, cycloalkyl substituents, 
aromatic substituents, or other substituents in the carbon backbone of the molecule, and/or variants containing 
the foregoing or other substituents or groups in place of hydrogen atoms on the carbon backbone of the 
molecule. 
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Table 3 



Item Flanking 
No. Residue#l 



Flanking 
Residue#2 



Aminocarboxylic acid 
Linker 



10 



1 aminopropanedioic acid 



2 aminopropanedioic acid 



3 aminopropanedioic acid 



15 4 aminopropanedioic acid 



20 



25 



30 



35 11 



40 



13 



14 



45 



5 aminopropanedioic acid 



6 aminopropanedioic acid 



7 aminopropanedioic acid 



8 aspartic acid 



aspartic acid 



10 aspartic acid 



aspartic acid 



12 aspartic acid 



aspartic acid 
glutamic acid 



2.3- diarnkiopropanoic acid 

2.4- diaminobutanoic acid 

2.5- diaminopentanoic acid 

2.6- diaminohexanoic acid 

2.7- diaminoheptanoic acid 

2.8- diaminooctanoic acid 

2.9- diaminononanoic acid 
23-diaminopropanoic acid 

2.4- diaminobutanoic acid 

2.5- diaminopentanoic acid 

2.6- diaminohexanoic acid 

2.7- diaminoheptanoic acid 

2.8- diaminooctanotc acid 
2,3-diarninopropanoic acid 
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7- aminoheptanoic acid; 

8- aminooctanoic acid; 

9- aminononanoic acid 

6- arainohexanoic acid; 

7- arninoheptanoic acid; 

8- aminooctanoic acid 

5- aminopentanoic acid; 

6- aminohexanoic acid; 

7- aminoheptanoic acid 

4- aminobutanoic acid; 

5- aminopentanoic acid; 

6- aminohexanoic acid 

3- aminopropanoic acid; 

4- aminobutanoic acid; 

5- aminopentanoic acid 

aminoethanoic acid; 

3- aminopropanoic acid; 

4- aminobutanoic acid 

aminoethanoic acid; 
3-aminopropanoic acid 

6- aminohexanoic acid; 

7- aminoheptanoic acid; 

8- aminooctanoic acid 



5- aminopentanoic acid; 

6- aminohexanoic acid; 

7- aminoheptanoic acid 

4- aminobutanoic acid; 

5- aminopentanoic acid; 

6- aminohexanoic acid 

3- aminopropanoic acid; 

4- aminobutanoic acid; 

5- aminopentanoic acid 

aminoethanoic acid; 

3- aminopropanoic acid; 

4- aminobutanoic acid 

aminoethanoic acid; 
3-aminopropanoic acid 

5- aminoheptanoic acid; 

6- aminohexanoic acid; 

7- aminoheptanoic acid 
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15 glutamic acid 

16 glutamic acid 

5 

17. glutamic acid 

10 18 glutamic acid 

19 2-aminohexanedioic acid 

15 20 2-aminohexanedioic acid 

21 2-aminohexanedioic acid 

20 

22 2-aminohexanedioic acid 

23 2-aminoheptanedioic acid 



25 



30 



35 



24 2-aminoheptanedioic acid 



25 2-aminoheptanedioic acid 



26 2-aminooctanedioic acid 



27 2-aminooctanedioic acid 



28 2-aminononanedioic acid 



2.4- diaminobutanoic acid 

2.5- diaminopentanoic acid 

2.6- diaminohexanoic acid 

2.7- diaminoheptanoic acid 
2,3-diaminopropanoic acid 

2 f 4-diaminobutanoic acid 

2,5-diaminopentanoic acid 

lysine 

2.3- diaminopropanoic acid 

2.4- diaminobutanoic acid 

2.5- diaminopentanoic acid 
2,3-diaminopropanoic acid 

2 t 4-diaminobutanoic acid 
2,3-diaminopropanoic acid 
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4- aminobutanoic acid; 

5- aminoheptanoic acid; 

6- aminobexanoic acid 

3- arnmopropanoic acid; 

4- aminobutanoic acid; 

5- aminoheptanoic acid 

aminoethanoic acid; 

3- aminopropanoic acid; 

4- aminobutanoic acid 

aminoethanoic acid; 

3- aminopropanoic acid 

4- aminobutanoic acid; 

5- aminoheptanoic acid; 

6- aminohexanoic acid 

3- aminopropanoic acid; 

4- aminobutanoic acid; 

5- aminoheptanoic acid 

aminoethanoic acid; 

3- aminopropanoic acid; 

4- aminobutanoic acid 

aminoethanoic acid; 
3-aminopropanoic acid 

3- aminopropanoic acid; 

4- aminobutanoic acid; 

5- aminpheptanoic acid 

aminoethanoic acid; 

3- aminopropanoic acid; 

4- aminobutanoic acid 

aminoethanoic acid; 
3-aminopropanoic acid; 

aminoethanoic acid; 

3- aminopropanoic acid; 

4- aminobutanoic acid 

aminoethanoic acid; 
3-aminopropanoic acid 

aminoethanoic acid; 
3-aminopropanoic acid 



(ii) Cyclization Methods 

40 Once the flanking residues and difUnctional linker have been selected and the peptide chain spanning 

the flanking residues has been synthesized on solid phase, the difunctional linker can be used to cyclize the 
solid phase-boundpeptide by any convenient method. It will be appreciated that the invention encompasses 
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metnods of cyclizing a peptide after the finished peptide chain is fully synthesized, and methods of cyclizing 
the peptide at any point during peptide synthesis in which the peptide chain contains the flanking residues that 
are to be cross linked by the diftmctionai linker. Methods for cyclizing the peptide include (1) deprotecting 
the side chain amide bond-forming substituents of the flanking residues and reacting the solid phase peptide 
5 with the diftmctionai linker to simultaneously form amide linkages between the two functional groups of the 
linker and the side chain amide bond-forming substituents of both flanking residues; (2) deprotecting the side 
chain amide bond^formingsubstituentof oniy one of the flanking residues (without deprotecting the side chain 
amidebond^formingsubstituentof the other flanking residue), reacting the Afunctional linker with the solid 
phase peptide to form an amide linkage between one functional group on the linker and the side chain amide 
1 0 bond-forming substituent of the deprotected flanking residue, deprotecting the side chain amide bond-forming 
substituent of the other flanking residue, and then mtramolecularly reacting the free functional group on the 
linker and the side chain amide bond-forming substituent of the other flanking residue, thereby cyclizing the 
peptidkand (3) deprotectingthe side chain amide bond-forming substituents of both of the flanking residues, 
obtaining a monoprotected diftmctionai linker wherein only one of the linker's two amide bond-forming 
1 5 functional groups is capable of reacting with a counterpart side chain amide bond-forming substituent in a 
flanking residue, reacting the-monoprotected, dirunctional linker with the solid phase peptide to form an amide 
linkage between the free functional group on the linker and the side chain amide bond-forming substituent of 
one of* the deprotected flanking residues, deprotecting the blocked functional group on the linker, and then 
intramolecularly reacting the free functional group on the linker and the side chain amide bond-forming 
20 substituent of the other flanking residue, thereby cyclizing the peptide. The orthogonal deprotection reactions, 
non-orthogonal deprotection reactions, and amide bond formation reactions can be performed as described 
in Section (B)(H)(l)(a) above. 

'! In implementing the methods of the invention generally described as methods (2) and (3) above, it 
is desjrableto use synthesis schemes that exploit the advantages of orthogonal protection and deprotection of 
25 functional groups to avoid formation of unwanted derivatives. It will be evident to the practitioner from the 
follo^ingrepresentativesyntheticschemes that the protecting groups for the side chain amide bond-forming 
substituents of the flanking ! residues, the method of peptide synthesis used, and the sequence of peptide 
cycliiationreactions can be selected such that each of these components of the synthetic scheme increases the 
specificity of the reactions and improves yield of the desired product. 
30 (iii) Cyclization Using Diamine Linkers 

In an exampteusing carboxy substituents for the side chain amide bond-forming substituents of both 
flanking residues, a diamine linker for cyclization, and Fraoc chemistry for peptide synthesis, the carboxy 
substituentsare orthogonally protected with respect to each other and with respect to the Fmoc-protected cc- 
amino group of the N-terminal residue in the peptide chain by using an allyl group to protect the carboxy 
3 5 substituent of one flanking residue and a t-buty 1 ester to protect the carboxy substituent of the other flanking 
residk in this example, the peptide can be cyclized by (1) using reduction to deprotect the allyl-protected 
carboxy substituent of one flanking residue (without deprotecting the t-butyt ester-protected carboxy 
substituent of the other flanking residue); (2) reactingan unprotectedor monoprotected (e,g. allyloxycarbonyl- 
or Boc-monoprotected)diamine linker with the solid phase peptide to form an amide linkage between one of 
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the linker's amino groups and the deprotectedcarboxy substituent; (3) using acidolysis to deprotect the t-butyi 
ester-protectedcarboxysubstituentofthe other flanking residue and deprotect the Boc-protected amino group 
of the linker if a Boc-monoprotecteddiamine linker is used as the linker; (4) using reduction to deprotect the 
ally loxycarbony 1-protected amino group of the linker if an ally loxycarbony 1-rnonoprotected diamine linker 
5 is used as the linker; and (5) intramolecuJarlyreactingthe tree carboxy substituent of the other flanking residue 
with the free amino group of the linker to form an amide linkage that cyclizes the peptide. 

Alternatively, the peptide can be cyclized by (1) using acidolysis to deprotect the t-bulyf ester- 
protected carboxy substituent of one flanking residue (without deprotecting the allyl-protected carboxy 
substituent of the other flanking residue); (2) reacting an unprotectedormonoprotected(e,g. allyloxycarbonyi- 
1 0 or Boc-monoprotected)diamine linker with the solid phase peptide to form an amide linkage between one of 
the linker's amino groups and the deprotected carboxy substituent; (3) using reduction to deprotect the allyl- 
protected carboxy substituentof the other flanking residue and deprotectthe altyloxycarbonyl-protectedamino 
group of the linker if an allyioxycarbonyl-monoprotected diamine linker is used as the linker; (4) using 
acidolysis to deprotect the Boc-protected amino group of the linker if a Boc-monoprotected diamine linker 
15 is used as the linker; and (5) intramolecuiarly reacting the free carboxy substituentof the other flanking residue 
with the free amino group of the linker to form an amide linkage that cyclizes the peptide. 

In an example using carboxy substituents for the side chain amide bond-forming substituents of both 
flanking residues, a diamine linker for cyclization, and Boc chemistry for peptide synthesis, the carboxy 
substituents are orthogonally protected with respect to each other and with respect to the Boc-protected a- 
20 amino group of the N-terminal residue in the peptide chain by using an allyl group to protect the carboxy 
substituentof one flanking residue and a fluorenylmethyl (Fm) ester to protect the carboxy substituent of the 
other flanking residue. In this example, the peptide can be cyclized by (1) using reduction to deprotectthe 
allyl-protected carboxy substituent of one flanking residue (without deprotecting the Fm ester-protected 
carboxy substituent of the other flanking residue); (2) reacting an unprotected or monoprotected (e.g. 
25 allyloxycarbonyl- or Fmoc-monoprotected) diamine linker with the solid phase peptide to form an amide 
linkage between one of the linker's amino groups and the deprotected carboxy substituent; (3) using base 
saponification to deprotect the Fm ester-protected carboxy substituent of the other flanking residue and 
deprotect the Fmoc-protected amino group of the linker if a Fmoc-monoprotected diamine linker is used as 
the linker; (4) using reduction to deprotect the allyloxycarbonyl-protected amino group of the linger if an 
30 allyioxycarbonyl-monoprotected diamine linker is used as the linker; and (5) intramolecuiarlyreactingthe free 
carboxy substituent of the other flanking residue with the free amino group of the linker to form an amide 
linkage that cyclizes the peptide. 

Alternatively, the peptide can be cyclized by (1) using base saponification to deprotect the Fm ester- 
protected carboxy substituent of one flanking residue (without deprotecting the allyl-protected carboxy 
35 substituent of the other flanking residue); (2) reacting an unprotectedor monoprotected (e.g. allyloxycarbonyl- 
or Fmoc : monoprotected) diamine linker with the solid phase peptide to form an amide linkage between one 
of the linker's amino groups and the deprotectedcarboxy substituent; (3) using reduction to deprotectthe allyl- 
protected carboxy substituentof the other flanking residue and deprotectthe ally loxycarbony 1-protectedamino 
group of the linkerif an allyioxycarbonyl-monoprotected diamine linker is used as the linker; (4) using base 
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saponification to deprotect the Fmoc-protected amino group of the linker if a Frnoc-monoprotected diamine 
linker is used as the l inker; and (5) intrarnoiecularly reacting the free carboxy substituent of the other flanking 
residue with the free amino group of the linker to form an amide linkage that cyclizes the peptide. 

(iv) Cyciization Using Dicarboxylic Acid Linkers 
5 In an example using amino substituents for the side chain amide bond-forming substituents of both 

flanking residues, a dicarboxylic acid linker for cyciization, and Fmoc chemistry for peptide synthesis, the 
amino substituents are orthogonally protected with respect to each other and with respect to the Fmoc- 
protected ct-amino group of the N-terminai residue in the peptide chain by using an ailyioxycarbonyl group 
to protect the amino substituent of one flanking residue and a Boc group to protect the amino substituent of 

1 0 the other flanking residue . In this example, the peptide can be cyciized by ( 1 ) using reduction to deprotect the 
allyioxycarbonyi-protectedamino substituent of one flanking residue (without deprotecting the Boc-protected 
amino substituent of the other flanking residue); (2) reacting an unprotected or monoprotected (e.g. allyl- or 
t-butyl ester-monoprotected) dicarboxylic acid linker with the solid phase peptide to form an amide linkage 
between one of the linker's carboxy groups and the deprotected amino substituent; (3) using acidolysis to 

1 5 deprotect the Boc-protectedamino substituent of the other flanking residue, and to deprotect the t-buty 1 ester- 
protected carboxy group of the linker if a t-buty I ester-monoprotected dicarboxylic acid linker is used as the 
linker; (4) using reduction to deprotect the aiiyl-protected carboxy group of the linker if an allyi- 
monoprotected dicarboxylic acid linker is used as the linker; and (5) inrramolecularly reacting the free amino 
substituent of the other flanking residue with the free carboxy group of the linker to form an amide linkage 

20 that cychzes the peptide. 

Alternative iy, the peptide can be cyciized by (1) using acidolysis to deprotect the Boc-protected 
amino substituent of one flanking residue (without deprotecting the allyloxycarbonyl-protected amino 
substituent of the other flanking residue); (2) reacting an unprotected or monoprotected (e.g. ally]- or t-butyl 
ester-monoprotected)dicarboxylicacid linker with the solid phase peptide to form an amide linkage between 

25 one of the linker' s carboxy groups and the deprotected amino substituent; (3) using reduction to deprotect the 
allyloxycarbonyl-protected amino substituentof the other flankingresidue, and to deprotectthe ally 1 -protected 
carboxy group of the linker if an allyl-monoprotected dicarboxylic acid linker is used as the linker; (4) using 
acidolysisto deprotectthe t-butyl ester-protectedcarboxy group of the linker if a t-butyl ester-monoprotected 
dicarboxylicacid linker is used as the linker; and (5) intrarnoiecularly reacting the free amino substituent of 

30 the other flanking res idue with the free carboxy group of the linker to form an amide linkage that cyclizes the 

' peptide, 

In an example using amino substituents for the side chain amide bond-forming substituents of both 
flanking residues, a dicarboxylic acid linker for cyciization, and Boc chemistry for peptide synthesis, the 
amino substituents are orthogonallyprotected with respectto each other and with respect to the Boc-protected 
35 a-amino group of the N-terminai residue in the peptide chain by using an ailyioxycarbonyl group to protect 
the amino substituent of one flanking residue and a Fmoc group to protect the amino substituent of the other 
flanking residue. In this example, the peptide can be cyciized by (1) using reduction to deprotect the 
allyloxycarbonyl-protected amino substituent of one flanking residue (without deprotecting the Fmoc- 
protected amino substituent of the other flanking residue); (2) reacting an unprotected or monoprotected (e.g. 
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ally!- or Fm ester-monoprotected) dicarboxyiic acid linker with the solid phase peptide to form an amide 
linkage between one of the linker's carboxy groups and the deprotected amino substituent; (3) using base 
saponification to deprotect the Fmooprotected amino substituent of the other flanking residue, and to 
deprotect the Fm ester-protected carboxy group of the linker if a Fm ester-monoprotected dicarboxyiic acid 
5 linker is used as the linker; (4) using reduction to deprotect the allyt-protected carboxy group of the linker if 
an allyl-monoprotecteddicarboxylicacid linker is used as the linker; and (5) intramolecularly reacting the free 
ammo substituent of the other flanking residue with the free carboxy group of the linker to form an amide 
linkage that cyciizes the peptide. 

Alternatively, the peptide can be cycHzed by (1) using base saponification to deprotect the Fmoc- 
10 protected amino substituent of one flanking residue (without deprotecting the allyloxycarbonyl-protected 
amino substituent of the other flanking residue); (2) reacting an unprotected or monoprotected (e.g. allyl- or 
Fm ester-monoprotected) dicarboxyiic acid linker with the solid phase peptide to form an amide linkage 
between one of the linker's carboxy groups and the deprotected amino substituent; (3) using reduction to 
deprotectthe allyloxycarbonyl-protectedamino substituent of the other flanking residue, and to deprotect the 
1 5 ally I-protected carboxy group of the linker if an aliyl-monoprotected dicarboxyiic acid linker is used as the 
linker; (4) using base saponification to deprotectthe Fm ester-protectedcarboxy group of the linker if a Fmoc- 
monoprotecteddicarboxylicacid linker is used as the linker; and (5) intramolecularly reacting the free amino 
substituent of the other flanking residue with the free carboxy group of the linker to form an amide linkage 
that cyclizes the peptide. 
20 (v) Cyclization Using Aminocarboxylic Acid Linkers 

In an example using an amino substituent for the side chain amide bond-forming substituent of one 
flanking residue, a carboxy substituent for the side chain amide bond-formingsubstituentof the other flanking 
residue, an aminocarboxylic acid linker for cyclization, and Fmoc chemistry for peptide synthesis, the side 
chain amide bond-forming substituents of the flanking residues are orthogonally protected with respect to each 
25 other and with respectto the Fmoc-protecteda-amino group of the N-terminal residue in the peptide chain by 
using an allyloxycarbonyl group to protect the amino substituent of one flanking residue and a t-butyl ester 
to protect the carboxy substituent of the other flanking residue. In this example, the peptide can be cyclized 
by (1) using reduction to deprotect the allyloxycarbonyl-protected amino substituent of one flanking residue 
(without deprotectingthe t-butyl ester-protectedcarboxy substituent of the other flanking residue); (2) reacting 
30 an unprotected or amino-protected (e.g. allyloxycarbonyl-protected amino or Boc-protected amino) 
aminocarboxylic acid linker with the solid phase peptide to form an amide linkage between the linker's 
carboxy group and the deprotected amino substituent; (3) using acidolysis to deprotect the t-butyl ester- 
protectedcarboxy substituent of the other flanking residue, and to deprotectthe Boc-protected amino group 
of the linker if an aminocarboxylic acid with a Boc-protected amino group is used as the linker; (4) using 
35 reduction to deprotect the allyloxycarbonyl-protected amino group of the linker if an aminocarboxylic acid 
with an allyloxycarbonyl-protected amino group is used as the linker; and (5) intramolecularly reacting the 
free carboxy substituent of the other flanking residue and the free amino group of the aminocarboxylic acid 
linker to cyciize the peptide. 



-102- 



BNSDCXIID: <WO ©920036A1_L_> 



WO 98/20036 



PCT/US97/20069 



Alternatively* the peptide can be cyclized by (1) using acidoiysis to deprotect the t-butyl ester- 
protected carboxy substituent of one flanking residue (without deprotecting the allyloxycarbonyi-protected 
ammo substituent of the other flanking residue); (2) reacting an unprotected or carboxy-protected (e.g. allyl- 
or t-butyl ester-protectedcarboxy)aminocarboxy lie acid linker with the solid phase peptide to form an amide 
5 linkage between the linker's amino group and the deprotected carboxy substituent; (3) using reduction to 
deprotect the allyloxycarbonyl-protectedamino substituent of the other flanking residue, and to deprotect the 
altyl-protectedcarboxy group of the linker if an aminocarboxylic acid with an aliyl-protected carboxy group 
is used as the linker; (4) using acidoiysis to deprotect the t-butyl ester-protected carboxy group of the linker 
if an aminocarboxylic acid with a t-butyl ester-protected carboxy group is used as the linker; and (5) 
10 intrarnolecularly reacting the free amino substituent of the other flanking residue and the free carboxy group 
of the aminocarboxylic acid linker to cyclize the peptide. 

In another example using an amino substituent for the side chain amide bond-forming substituent of 
one flanking residue, a carboxy substituent for the side chain amide bond-forming substituent of the other 
flanking residue, an aminocarboxylic acid linker for cyclization, and Fmoc chemistry for peptide synthesis, 
1 5 the side chain amide bond-formingsubstituentsof the flanking residues are orthogonally protected with respect 
to each other and with respect to the Fmoc-protected a-amino group of the N-terminal residue in the peptide 
chain by using a Boc group to protect the amines substituent of one flanking residue and an ally! group to 
protectthe carboxy substituentof the other flanking residue. In this example, the peptide can be cyclized by 
(1) using acidoiysis to deprotect the Boc-protected amino substituent of one flanking residue (without 
20 deprotecting the allyl-protectedcarboxy substituentof the other flanking residue); (2) reacting an unprotected 
or amino-protected (e.g. allyloxycarbonyi-protected amino or Boc-protected amino) aminocarboxylic acid 
linker with the solid phase peptide to form an amide linkage between the linker's carboxy group and the 
deprotected amino substituent; (3) using reduction to deprotect the allyl-protected carboxy substituent of the 
other flanking residue, and to deprotect me allyloxycarbonyi-protected amino group of the linker if an 
25 aminocarboxy licacid with a ally ioxy carbony i-protectedamino group is used; (4) using acidoiysis to deprotect 
the Boc-protected amino group of the linker if an aminocarboxylic acid with an Boc-protected amino group 
is used as the linker; and (5) inframolecularlyreacting the .free carboxy substituent of the other flanking residue 

and the free amino group of the aminocarboxylic acid linker to cyclize the peptide. 

Alternatively, the peptide can be cyclized by (1) using acidoiysis to deprotect the Boc-protected 
30 amino substituentof one flanking residue (without deprotecting the allyl-protected carboxy substituent of the 
other flanking residue); (2) reacting an unprotected or amino-protected (e.g. allyloxycarbonyi-protected or 
Boc-protected amino) aminocarboxylic acid linker with the solid phase peptide to form an amide linkage 
between the linker's carboxy group and the deprotected amino substituent; (3) using reduction to deprotect 
the allyi-protected carboxy substituent of the other flanking residue, and to deprotect the ally Ioxy carbony 1- 
35 protected amino group of the linker if an aminocarboxylic acid with an allyloxycarbonyi-protected amino 
group is used as the linker; (4) using acidoiysis to deprotect the Boc-protected amino group of the linker if an 
aminocarboxylic acid with a Boc-protectedamino group is used as the linker; and (5) intramolecularlyreacting 
the free carboxy substituentof the other flanking residue and the free amino group of the aminocarboxylic acid 
linker to cyclize the peptide. 
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In an example using an amino substituent for the side chain amide bond-forming substituent of one 
flanking residue, a carboxy substituent for the side chain amide bond-forming substituent of the other flanking 
residue, an arninocarboxyiicacid linker for cyclization,and Boc chemistry for peptide synthesis, the side chain 
amide bond-forming subs'tituentsof the flanking residues are orthogonallyprotectedwithrespectto each other 
5 and with respect to the Boc-protected a-amino group of the N-terminal residue in the peptide chain by using 
an allyloxycarbonyl group to protect the amino substituent of one flanking residue and a Fm ester to protect 
the carboxy substituent of the other flanking residue. In this example, the peptide can be cyclized by (1 ) using 
reduction to deprotect the allyloxycarbonyl-protected amino substituent of one flanking residue (without 
deprotecting the Fm ester-protected carboxy substituent of the other flanking residue); (2) reacting an 
10 unprotected or ammo-protected (e,g, allyloxycarbonyl-protected amino or Fmoc-protected amino) 
aminocarboxylic acid linker with the solid phase peptide to form an amide linkage between the linker's 
carboxy group and the deprotectedamino substituent; (3) using base saponification to deprotect the Fm ester- 
protected carboxy substituent of the other flanking residue, and to deprotect the Fmoc-protected amino group 
of the linker if an aminocarboxylic acid with a Fmoc-protected amino group is used as the linker; (4) using 
15 reduction to deprotect the allyloxycarbonyl-protected amino group of the linker if an aminocarboxylic acid 
with an allyloxycarbonyl-protected amino group is used as the linker; and (5) intramolecularly reacting the 
free carboxy substituent of the other flanking residue and the free amino group of the aminocarboxylic acid 
linker to cyclize the peptide. 

Alternatively, the peptide can be cyclizedby (1) using base saponification to deprotect the Fm ester- 
20 protected carboxy substituent of one flanking residue {without deprotecting the allyloxycarbonyl-protected 
amino substituent of the other flanking residue); (2) reacting an unprotected or carboxy-protected (e.g. aliyl- 
or Fm ester-protected carboxy) aminocarboxylic acid linker with the solid phase peptide to form an amide 
linkage between the linker's amino group and the deprotected carboxy substituent; (3) using reduction to 
deprotect the allyloxycarbonyl-protectedamino substituent of the other flanking residue, and to deprotect the 
25 allyl-protectedcarboxy group of the linker if an aminocarboxylic acid with an allyi-protected carboxy group 
is used as the linker; (4) using base saponification to deprotect the Fm ester-protected carboxy group of the 
linker if an aminocarboxylic acid with a Fm ester-protected carboxy group is used as the tinker; and (5) 
intramolecularlyreactingthe free amino substituent of the other flanking residue and the free carboxy group 
of the aminocarboxylic acid linker to cyclize the peptide. 
7 30 In another example using an amino substituent for the side chain amide bond-forming substituent of 

one flanking residue, a carboxy substituent for the side chain amide bond-forming substituent of the other 
flankingresidue, an arninocarboxyiicacid linker for cyclization, and Boc chemistry for peptide synthesis, the 
side chain amide bond-forming substituents of the flanking residues are orthogonally protected with respect 
to each other and with respect to the Boc-protected a-amino group of the N-terminal residue in the peptide 
35 chain by using a Fmoc group to protect the amino substituent of one flanking residue and an ally! group to 
protectthe carboxy substituent of the other flanking residue. In this example, the peptide can be cyclized by 
(1) using base saponification to deprotect the Fmoc-protected amino substituent of one flanking residue 
(without deprotecting the allyl-protected carboxy substituent of the other flanking residue); (2) reacting an 
unprotected or amino-protected (e.g. allyloxycarbonyl-protected amino or Fmoc-protected amino) 
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aminocarboxylic acid linker with the solid phase peptide to form an amide linkage between the linker's 
carboxy group and the deprotected amino substituent; (3) using reduction to deprotect the allyl-protected 
carboxy substituent of the other flanking residue, and to deprotect the allyloxycarbonyl-protectedamino group 
of the linker if an aminocarboxylicacid with a allyloxycarbonyl-protectedarnino group is used; (4) using base 
5 saponiflcationto deprotectthe Fmoc-protected amino group of the linker if an aminocarboxylic acid with an 
Fmoc-protected amino group is used as the linker; and (5) intramolecularly reacting the free carboxy 
substituent of the other flanking residue and the free amino group of the aminocarboxylicacid linker to cyciize 
the peptide. 

Alternatively, the peptide can be cyclized by (1) using reduction to deprotect the allyl-protected 
10 carboxy substituent of one flanking residue (without deprotecting the Fmoc-protected amino substituent of 
the other flanking residue); (2) reacting an unprotected or carboxy-protected (e.g. allyl-protected or Fm ester- 
protected carboxy) aminocarboxylicacid linker with the solid phase peptide to form an amide linkage between 
the linker's amino group and the deprotected carboxy substituent; (3) using base saponification to deprotect 
the Fmoc-protected amino substituent of the other flanking residue, and to deprotect the Fm ester-protected 
1 5 carboxy group of the linker if an aminocarboxylicacid with a Fm ester-protectedcarboxy group is used as the 
linker; (4) using reduction to deprotect the allyl-protected carboxy group of the linker if an aminocarboxylic 
acid with an allyl-protected carboxy group is used as the linker; and (5) intramolecularly reacting the free 

amino substituent of the other flanking residue and the free carboxy group of the aminocarboxylic acid linker 

to cyciize the peptide, 

20 In yet another embodiment using an amino substituent for the side chain amide bond-forming 

substituent of one flanking residue, a carboxy substituent for the side chain amide bond-forming substituent 
of the other flanking residue, an aminocarboxylicacid linker for cyclization, and Fmoc chemistry for peptide 
synthesis, the regioselectivity of the cyclization procedure is provided by orthogonally protecting the side 
chain amide bond-forming substituents of the flanking residues with respect to the Fmoc-protected ce-amino 

.25 group of the N-terminal residue in the peptide chain but not with respect to each other, and orthogonally 
protecting one of the aminocarboxylic acid linker's functional groups with respect to the Fmoc-protected a- 
amino group of the N-term.inai residue in the peptide chain. 

] n an exampleof the foregoing embodiment using an allyloxycarbonyl-protected amino substituent 
as the side chain amide bond-forming substituent of one flanking residue, an allyl-protected carboxy 

30 substituent as the side chain amide-bond forming substituent of the other flanking residue, a monoprotected 
aminocarboxylic acid linker, and Fmoc chemistry for peptide synthesis, the peptide can be cyclized by (1) 
using reduction to orthogonally deprotect the side chain amide bond-forming substituents of the flanking 
residues (without deprotecting the Fmoc-protected a-amino group of the N-terminal residue in the peptide 
chain); (2) reacting a carboxy-protected(e.g, ally 1- or t-butyl ester protected carboxy) or amino-protected (e.g. 
35 allyloxycarbonyl-or Boc-protectedamino) aminocarboxylic acid linker with the solid phase peptide to form 

an amide linkage between the unprotected functional group of the linker and the corresponding side chain 

amide bond-forming substituent on one of the flanking residues; (3) using reduction or acidolysis, as 
appropriate, to deprotectthe protected functional group of the linker; and (4) intramolecularly reacting the free 



-105- 



WO 98/20036 



PCT/US97/20069 



side chain amide bond-forming substituent of the other flanking residue and the free functional group of the 
linker to cyciize the peptide.. 

In an example of the foregoing embodiment using a Boc-protectedamino substituentas the side chain 
amide bond-forming substituent of one flanking residue, a t-butyl ester-protected carboxy substituent as the 
5 side chain amide-bond forming substituent of the other flanking residue, a monoprotected aminocarboxyiic 
acid linker, and Fmoc chemistry for peptide synthesis, the peptide can be cyclized by (1) using acidolysis to 
orthogonally deprotect the side chain amide bond-forming substituents of the flanking residues (without 
deprotectingthe Fmoc-protected ce-amino group of the N-terminal residue in the peptide chain); (2) reacting 
a carboxy-protected(e,g. allyl or t-buty! ester-protected carboxy) or amino-protected (e.g. allyloxycarbonyl- 

10 or Boc-protected amino) aminocarboxyiic acid linker with the soiid phase peptide to form an amide linkage 
between the unprotected functional group of the linker and the corresponding side chain amide bond-forming 
substituenton one of the flanking residues; (3) using reduction or acidolysis, as appropriate, to deprotect the 
protected functional group of the linker; and (4) intramolecularly reacting the free side chain amide bond- 
forming substituent of the other flanking residue and the free functional group of the linker to cyclize the 

15 peptide. 

In still another embodiment using an amino substituent for the side chain amide bond-forming 
substituent of one flanking residue, a carboxy substituent for the side chain amide bond-forming substituent 
of the other flanking residue, an aminocarboxyiic acid linker for cyciization, and Boc chemistry for peptide 
synthesis, the regioselectivity of the cyciization procedure is provided by orthogonally protecting the side 

20 chain amide bond-forming substituents of the flanking residues with respect to the Boc-protected oe-amino 
group of the N-terminal residue in the peptide chain but not with respect to each other, and orthogonally 
protecting one of the aminocarboxyiic acid linker's functional groups with respect to the Boc-protected a- 
amino group of the N-terminal residue in the peptide chain. 

In an example of the foregoing embodiment using an allyloxycarbbnyl-protected amino substituent 

25 as the side chain amide bond-forming substituent of one flanking residue, an allyl-protected carboxy 
substituent as the side chain amide-bond forming substituent of the other flanking residue, an aminocarboxyiic 
acid linker, and Boc chemistry for peptide synthesis, the peptide can be cyclized by (1) using reduction to 
orthogonally deprotect the side chain amide bond-forming substituents of the flanking residues (without 
deprotectingthe Boc-protected a-amino group of the N-terminal residue in the peptide chain); (2) reacting a 

30 carboxy-protected (e.g. allyl- or Fm ester-protected carboxy) or amino-protected (e.g. allyloxycarbonyl- or 
Fmoc-protected amino) aminocarboxyiic acid linker with the solid phase peptide to form an amide linkage 
between the unprotected functional group of the linker and the corresponding side chain amide bond-forming 
substituent on one of the flanking residues; (3) using reduction or base saponification, as appropriate, to 
deprotect the protected functional group of the linker; and (4) intramolecularly reacting the free side chain 

35 amide bond-forming substituent of the other flanking residue and the free functional group of the linker to 
cyclize the peptide. 

In an example of the foregoing embodiment using a Fmoc-protected amino substituent as the side 
chain amide bond-form ing substituent of one flanking residue, a Fm ester-protectedcarboxy substituent as the 
side chain amide-bond forming substituent of the other flanking residue, an aminocarboxyiic acid linker, and 
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-Boc chemistry for peptide synthesis, the peptide can be cyclized by (1) using base saponification to 
orthogonally deprotect the side chain amide bond-forming substituents of the flanking residues (without 
deprotectingtbe Boc-protectedct-amino group of the N-terminal residue in the peptide chain); (2) reacting a 
carboxy-protected (e,g. allyl- or Fm ester-protected carboxy) or amino-protected (e.g. allyloxycarbonyl- or 
5 Fmoc-protected amino) aminocarboxylic acid linker with the solid phase peptide to form an amide linkage 
between the unprotected functional group of the linker and the corresponding side chain amide bond-forming 
substituent on one of the flanking residues; (3) using reduction or base saponification, as appropriate, to 
deprotect the protected functional group of the linker; and (4) intramolecularly reacting the free side chain 
amide bond-forming substituent of the other flanking residue and the free functional group of the linker to 

10 cyclize the peptide. 

Following cyciization, the helix-constrained peptide is optionally cleaved away from the solid 
support, recovered and purified. The peptide can be removed from the solid support by a reagent capable of 
disrupting the peptide-solid phase link, and optionally deprotecting blocked side chain functional groups on 
the peptide. In one embodiment, the peptide is cleaved away from the solid phase by acidolysis with liquid 

1 5 hydrofluoricacid (HF), which also removes any remaining side chain protective groups. Preferably, in order 
to avoid alkylation of residues in the peptide (for example, alkyiation of methionine, cysteine, and tyrosine 
residues); the acidolysis reaction mixture contains thio-creso I and cresol scavengers. Following HF cleavage, 
the resin is washed with ether, and the free peptide is extracted from the resin with sequential washes of acetic 
acid solutions. The combined washes are lyophiiized, and the residue is purified. 

20 c. Liquid Phase Cyciization 

Alternatively, the peptide can be cleaved away from the solid support prior to the cyciization step. 
In one embodiment, after the difunctional linker is coupled to the side chain amide bond-forming substituent 
of the first flanking residue in the peptide, the peptide is cleaved away from the solid support. The peptide 
is recovered, deblocked at the side chain amide bond-forming substituent of the second flanking residue (if 

25 necessary), and then cyclized at low concentration in a reaction mixture to maximize intramolecular amide 
bond formation. Typically^ a maximum level of intramolecularamide bond formation can be achieved under 
conditions in which the concentration of the peptide provides an intramoiecularconcentration of free carboxy 
and amino groups that exceeds the intermolecular concentration of free carboxy and amino groups in the 
reaction mixture. In one embodiment, a peptide concentration of 1 nM to 1 M, and preferably I u>M to 1 mM, 

30 and more preferably i uM to 100 uM, is used to maximize cyciization. The cyciization of free peptide can 
be conducted with any of the amino acid coupling reactions used to helicize peptide bound to a solid support 
described above. 

d. Synthetic Schemes 

In one embodiment, any helix constrained compound of formulas ( 1 ), (1 a), ( 1 b), ( 1 c), ( 1 d), (le),(lf), 
35 and (1 g) is made by utilizing (in peptide synthesis as described in Section (B)(II)(l)(a) above) the particular 
combinationof flanking residues and diamine linker shown in Table 1 above that provides the values of n, m 
and p characterizing the compound of interest, and cyclizing the resulting peptide according to the methods 
described in Section (B)(1I)( 1 )(b)(H) or (iii) above. For example, any compound of formulas (1), (la), (i b), 
( lc)> (Id), (le), (1 fK and (lg) characterizedby m=0, p=0, and n=7, 8, or 9 can be made by utilizing (in peptide 
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synthesis as described in Section (BXH)(l)(a) above) the flanking residues and any diamine linkeriisted in 
Item No, 1 in Table 1 above, and cyclizingthe resulting peptide according to the methods described in Section 
(B)(II)( 1 )(b)(ii)or (iii) above. In another example, any compound of formulas ( I ), ( 1 a), ( i b), ( 1 c), ( 1 d), ( 1 e), 
(I f), and (lg) characterized by m=0, p=6 t and n=2 or 3, or characterized by m-6, p=0, and n-2 or 3, can be 

5 made by utilizing (in peptide synthesis.as described in Section (B)(ll)(l)(a) above) the flanking residues and 
any diamine linker listed in Item No. 7 in Table 1 above, and cyclizing the resulting peptide according to the 
methods described in Section (B)(II)(l)(b)(ii) or (iii) above. In yet another example, any compound of 
formulas (1), (la), (lb), (lc), (Id), (le), (If), and (lg) characterized by m=3, p=3, and n-2 or 3, can be made 
by utilizing (in peptide synthesis as described in Section (B)(Il)(l)(a) above) the flanking residues and any 

1 0 diamine linker listed in Item No. 16 in Table I above, and cyclizing the resulting peptide according to the 
methods described in Section (B)(II)(1 )(b)(ii) or (iii) above. 

In anotherembodiment, any helix constrained compound of formulas (2), (2a), (2b), (2c), (2d), (2e), 
(2f), (2g), (3), (3a), (3b), (3c), (3d),(3e), (3f), and (3g) is made by utilizing (in peptide synthesis as described 
in Section (B)(II)(l)(a) above) the flanking residues and any diamine linker listed in Item No. 9 in Table 1 

15 above, and cyclizing the resulting peptide according to the methods described in Section (B)(II)( 1 )(b)(n) or 
(iii) above. 

In another embodiment, any helix constrained compound of formulas (4), (4a), (4b), (4c), (4d), (4e), . 
(4f), and (4g) is made by utilizing (in peptide synthesis as described in Section (B)(II)(l)(a) above) the 
flanking residues and any diamine linker listed in Item No, 13 in Table 1 above, and cyclizing the resulting 

20 peptide according to the methods described in Section (B)(II)( 1 )(b)(ii) or (iii) above- 
In another embodiment, any helix constrained compound of formulas (5), (5a), (5b), (5c), (5d), (5e), 
(5f), and (5g) is made by utilizing (in peptide synthesis as described in Section (B)(H)(l)(a) above) the 
flanking residues and any diamine linker listed in Item No. 8 in Table 1 above, and cyclizing the resulting 
peptide according to the methods described in Section (B)(I1)( 1)0)00 or (iii) above. 

25 In anomer embodiment, any helix constrained compound of formulas (6), (6a), (6b), (6c), (6d), (6e), 

(6f), (6g), (1 1), (Ma), (1 1 b), (1 1c), (1 Id), (1 le), (1 If), and (1 lg) is made by utilizing (in peptide synthesis as 
described in Section (B)(II)(l)(a) above) the particular combination of flanking residuesand aminocarboxylic 
acid linker shown in Table 3 above that provides the values of q, r and s characterizing the compound of 
interest, or the values oft, u and v characterizing the compound of interest, as appropriate, and cyclizing the 

30 resultingpeptide according to the methods described in Section (B)(II)(l)(b)(ii) or (v) above. For example, 
any compound of formulas (6), (6a), (6b) t (6c), (6d), (6e), (6f), (6g), (1 1), (I la), (lib), (I lc), (1 Id), (1 le), 
( 1 If), and (1 1 g) characterized by q-1, s=0, and r=6, 7, or 8, or characterized by t=0, v=l , and u=6, 7, or S, 
as appropriate, can be made by utilizing (in peptide synthesis as described in Section (B)(ll)(l)(a) above) the 
flanking residues and any aminocarboxylicacid linker listed in Item No. \ in Table 3 above, and cyclizing the 

3 5 resulting peptide according to the methods described in Section (B)(I1)( 1 )(b)( ii) or (v) above. 

In another example, any compoundof formulas (6), (6a), (6b), (6c), (6d), (6e), (6f), (6g), (11), (11 a), 
(lib), (11c), (I id), (1 le), (1 If), and (1 lg) characterized by q-1, s=6, and r= lor 2, or characterized by t-6, 
v=l, and u=l or 2, as appropriate, can be made by utilizing (in peptide synthesis as described in Section 
; (B)(II)(I)(a)above) the flanking residues and any aminocarboxylic acid linker listed in Item No. 28 in Table 
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3 above, and cydizingfhe resulting peptide accordingto the methods described in Section (B)(II)(l)(b)(ii) or 

(v) above. .< . • . . 

In another examplei any compound of formulas (6), (6a.) y (6b), (6c), (6d), (6e), (6f) ? (6g), ( 11), (1 la), 
(lib), (lie), (lid), (lie), <llf),and (llg) characterized, by qs=7, s^O, and r^ lor 2, or characterized by t=0, 
5 V=7, ahdu=l or 2, as appropriate, can be made by utilizing (in peptide synthesis as described in Section 
(B)(tt)(l Xa) above) the flanking residues and any aminocarboxyiic acid linker listed in Item No. 7 in Table 
3above, and cyclizingthe resulting peptides accordingto the methods describe in Se^on (B^l)0>Xii) or 

(V) above. ■ • ,•..--...-.?•.••-...>•.*.•. . ; , . v.- ... . :. .■ . . • . . V. . . ,-. ..-t ; > 

In ahbmet examplevany compound of formulas (6> (6», (6b), (6c), (6d), (6e), (6f). (6g), (11), (1 la), 
10 (1 lb), (ltc^tnd),^^), (llf), ; and(l lg> characterizedby q=3, s^, and r==lor 2, or characterized by.t=4, 
v=3, and u=l or 2, as appropriate, can be made by utilizing (in peptide synthesis as descried in Section 
(B)(TI)(l)(a)abbve)theflankingresiduesand any aminocarbpxylic acid linker listed, in Item No. 25 in Table 
3 above', arid cyclizing the resulting peptide according to the methods described in Section (B)(H)(l)(b)(ii)or 

(v) above. • ■.-■.-.■,.<^ 

15 In another embodiment, any helix constrainedcomppund of formulas (7), (7a), (7b), (7c), (7d), (7e), 

(7f),{7g),(13),(13a),(13b),(13c),(13d),(13e),(13f),and(13g) is made by utilizing (in peptide synthesis as 
described in Section (BXI 1)0)0*) aboye> the:»flanking residues and any ammpcarboxylic acid linker listed in 
Item No. : 14 in Table 3 above; and cyclizing the resulting peptide: according to the methods described in 
Section (B)(II)(l)(b)(ii) or (v) fcbove. .• . , lV 

20 In another embodiment, any helix constrained compound of formulas (8), (8a), (8b), (8c), (8d), (8e), 

(8f)H8;gM*^ 

descfibedih Section (BXII)(l)(a) above) the flankmg residues and; any aminocarboxyiic acid linker listed in 
ftem Nb: 9 in Table 3 above, and cyclizingtheresultingpeptide aeeprdmgto the njethods ascribed in Section 

' (B)(Il)(l)(bXii) br (v) above, j • . »*!. k-,^- • : : ,,, ..v ..,,„■:, 

25 i In anomerembodimeht,any helix constrainedcompound of formulas (?), (9a), (9b), (9c), (9d), (9e), 

(9f), (9g), (15), (I5a),(15b),(15c), (15d),(15e),({5f) < and(15g) is made by utilizing (in peptide synthesis as 
describjed in Section (B)(II)( 10(a) above) the flaking residues and any anjinocarboxylic acid linker listed in 
• ftem No: 15 in Table 3 above, and cyclizing the resulting peptide according to the methods described in 
Sectibn(B)(IlXl)(b)('0or(v)aboye.' - r 
30 ; \n another embodiment,any helix constrainedcompoundoffo^ 

(lOeV(lOf), (lOg* (14), (14a), (14b), (14e); (14d), 04e), (14f), and (I4g) is made by . utilizing (in peptide 
synthesis as described in Section (.B)(II)(l)(a) above): the flankmg residues and any aminocarboxyiic acid 
linker listed hi item Nov 8 in Table 3 above, and cyclizing die resulting peptide according to the methods 

described in Section (B)^ ,. •:•£•.- ■ 

35 - in one embodiment, any helix constrained compound of formulas (16), (16a), (16b), (16c), (16d), 

(i6e)^i6f),ahd('16g)ismadeby utilizing (in pesptide synthesis, as described hi Section (B)(H)(l)(a) above) 
' the particular combination of flanking residues and rdicarboxy He acid linker shown in Table 2 above that 
prdviSesthe value* of W, x and y characterizmgwe comppurid of interest, and cyclizing the resulting peptide 
accordingto the methodsdescribedin Section (B)(HXl)(b)(ii>pr(iv). above, For example, any compound of 
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formulas (16), (16a), (16b), (16c), (16d), (16e),.(16f), and (16g) characterized by w-1, y»l, andx=5, 6, or 
7 can be made by utilizing (in peptide synthesis as described in Section (B)(II)(l)(a) above) the flanking 
residues and any dicarboxylic acid linker listed in Item No. 1 in Table 2 above, and cyciizing the resulting 
peptide according to the methods described in Section (B)(II)( I )(b)(ii)or (iv) above. In another example, any 
5 compoundof forrnu]as(16), (16a), (16b), (16c),(16d),(16e), (I6f), and (i6g) characterized by w= I T y=7, and 
x=0 or 1, or characterized by w=7, y-1, and x=0 or 1, can be made by utilizing (in peptide synthesis as 
described in Section (B)(II)( 1 )(a)above) the flanking residues and any dicarboxy lie acid linker listed in Item 
No. 7 in Table 2 above, and cyciizing the resulting peptide according to the methods described in Section 
(B){II)(l)(b)(ii) or (iv) above, in yet another example, any compound of formulas (16), ( 1 6a), (16b), (16c), 
10 (1 6d), ( 1 6e), (16f), and (1 6g) characterized by w-4, y=4, and x-0 or i , can be made by utilizing (in peptide 
synthesis as described in Section (B)(li)(l)(a) above) the flanking residues and any dicarboxylic acid linker 
listed in Item No, 16 in Table 2 above, and cyciizing the resulting peptide according to the methods described 
in Section (B)(Il)(l)(b)(u) or (iv) above. 

In another embodiment, any helix constrained compound of formulas ( 1 7), ( 1 7a), ( 1 7b), ( 1 7c), { 1 7d), 
15 (1 7e), ( 1 7f), ( 1 7g), ( 1 S), ( 1 8a), ( 1 8b), ( 1 8c), ( 1 8d), (1 8e), ( 1 8f), and ( 1 8g) is made by utilizing (in peptide 
synthesis as described in Section (B)(H)(l)(a) above) the flanking residues and any dicarboxylic acid linker 
listed in Item No. 2 in Table 2 above, and cyciizing the resulting peptide according to the methods described 
in Section (B)(I1)(1 )(b)(ii) or (iv) above. 

In another embodiment, any helix constrained compound of formulas ( 1 9), ( 1 9a), ( 1 9b) ? ( 1 9c) , ( 1 9d) a 
20 (1 9e), ( i 9f), and ( 1 9g) is made by utilizing (in peptide synthesis as described in Section (B)(H)( 1 )(a) above) 
the flanking residues and any dicarboxylic acid linker listed in Item No. 1 in Table2 above, and cyciizing the 
resulting peptide according to the methods described in Section (B)(II)(l)(b)(ii) or (iv) above. 

In another embodiment, any helix constrained compound of formulas (20), (20a), (20b), (20c), (20d), 
(20e), (20f), and (20g) is made by utilizing (in peptide synthesis as described in Section (B)(H)( 1 )(a) above) 
25 the flanking residues and any dicarboxylic acid linker listed in Item No. 8 in Table 2 above, and cyciizing the 
resulting peptide according to the methods described in Section (B)(II)(l)(b)(ii) or (iv) above. 

(2) Synthesis of Linear Peptide with Difunctional Linker-Coupled Flanking Amino Acid 
The peptide is designed such that the sequence to be helicized comprises an amino acid sequence that 
is six residues in length that extends between flanking residues as described in Section (B)(II)(l)(a) above. 
; 30 The peptide can be constructed using a modification of the solid phase synthesis methods described in Section 
(B)(II)(l)(a) above wherein one of the flanking residues is coupled to a difunctional linker before addition to 
the peptide chain. This aHows me lmker to be incorporated into the peptide as part of a standard amino acid 
The flanking residue can be coupled to the difunctional linker by any convenient means. Typically, 
the side chain am ide bond-forming substituent of the flanking residue is used to form an amide linkage with 
35 one of the functional groups on the linker. In one embodiment designed for use in conjunction with Fmoc 
chemistry, the linker-derivatizedflanking residue is created by obtaining from a commercial source an amino 
acid residue with an Fmoc-protecteda-ammosubstiment,at-butyl ester-protected a-carboxv substituent, and 
an unprotected side chain amino substituent, and then reacting the a-substituent protected amino acid with a 
difunctional linker having a free carboxy group to form an amide linkage between the linker's free carboxy. 
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group and the unprotected side chain amino substituent of the amino acid using any of the condensation 
methods described in Section (B)(IIX0(a) above. The t-butyl ester-protected a-carboxy substituent of the 
derivatized amino acid residue is then removed by acidolysisto permit incorporation of the derivatized amino 

acid into the peptide chain. - 
5 In another embodiment designed for use in conjunction with Fmoc chemistry, the linker-derivatized 

flanking residue is created by obtaining from a commercial source an amino acid residue with an Fmoc- 
protected a-amino substituent, an ally i-protected a-carboxy substituent, and an unprotected side chain amino 
substituent, and then reacting the a-substituent protected amino acid with a difunctional linker having a free 
carboxy group to form an amide linkage between the linker's free carboxy group and the unprotected side 
10 chain amino substituent of the amino acid using any of the condensation methods described in Section 
(B)(II)( l)(a) above. The allyl-protected a-carboxy substituent of the derivatized amino acid residue is then 
removed by reduction to permit incorporation of the derivatized amino acid into the peptide chain. 

In one embodiment designed for use in conjunction with Boc chemistry, the linker-derivatized 
flanking residue is created by obtaining from a commercial source an amino acid residue with an Boc- 
15 protected a-amino substituent, a Fm ester-protected a-carboxy substituent, and an unprotected side chain 
amino substituent, and then reactingthe a-substituent protected amino acid with a difunctional linker having 
a free carboxy group to form an amide linkage between, the linker' s free carboxy group and the unprotected 
side chain amino substituent of the amino acid using any of the condensation methods described in Section 
(B)(1I)( 1 ) (a) above. The Fm ester-protected a-carboxy substituent of the derivatized amino acid residue is 
20 then removed by base saponification to permit incorporation of the derivatized amino acid into the peptide 
chain. 

. . m another embodiment designed for use-in conjunction with Boc chemistry, the linker-derivatized 
flanking residue is created by obtaining from a commercial source an amino acid residue with an Boc- 
protected a-amino substituent,an allyl-protected «-carboxy substituent, and an unprotected side chain amino 
25 substitueniand then reacting the a-substituent protected amino acid with a difunctional linker having a free 
carboxy group to form an amide linkage between the linker's free carboxy group and the unprotected side 
chain amino substituent of the amino acid using any-of the condensation methods, described in Section 
(BXIIKlXa) above. The allyl-protected a-carboxy substituent of the derivatized amino acidresidue isthen 
removed by reduction to permit incorporation of the derivatized amino acid into the peptide chain. 
30 In one embodiment designed for use in conjunction with Fmoc chemistry, the linker-derivatized 

flanking residue is created by obtaining from a commercial source an amino acid residue with an Fmoc- 
• . protected a-amino substiment,a t-butyl ester-protected a-carboxy substituent, and an unprotected side chain 
carboxy substituent, and then reactingthe a-substituent protected amino acid with a difunctional linker having 
a free amino group to form an amide linkage between the linker's free ammo group and me unprotected side 
35 chain carboxy substituent of the amino acid using any of the condensation methods described in Section 
(BXilXlXa) above: The t-butyl ester-protected a-carboxy substituent of the derivatized amino acid residue 
isthen removed by acidolysisto pennit incorporation of the derivatized amino acid into the peptide chain. 

In another embodiment designed for use in conjunction with Fmoc chemistry, the linker-derivatized 
flanking residue is created by obtaining from a commercial source an amino acid residue with an Fmoc- 
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protected a-amino substituent, an allyl-protecteda-carboxy substituent, and an unprotectedside chain carboxy 
substituent.and then reacting the a-substituent protected amino acid with a difiinctiona! linker having a free 
amino group to form an amide linkage between the linker's free amino group and the unprotected side chain 
carboxy substituentof the amino acid using any of the condensation methods described in Section (BXIDOX*) 

5 above. The allyl-protected a-carboxy substituent of the derivatized amino acid residue is then removed by 
reduction to permit incorporation of the derivatized amino acid into the peptide chain. 

In one embodiment designed for use in conjunction with Boc chemistry, the linker-derivatized 
flanking residue is created by obtaining from a commercial source an amino acid residue with an Boc- 
protected a-amino substituent, a Fm ester-protected a-carboxy substituent, and an unprotected side chain 
10 carboxy substituent, and then reacting the ct-substituentprotected amino acid witha difunctional linker having 
a free amino group to form an amide linkage between the linker's free amino group and the unprotected side 
chain carboxy substituent of the amino acid using any of the condensation methods described in Section 
(B)(II)(1 )(a) above. The Fm ester-protected a-carboxy substituent of the derivatized amino acid residue is 
then removed by base saponification to permit incorporation of the derivatized amino acid into the peptide 

15 chain. 

In another embodiment designed for use in conjunction with Boc chemistry, the linker-derivatized 
flanking residue is created by obtaining from a commercial source an amino acid residue with an Boc- 
' protected a-amino substituent,anal^ 

substituent,and then reacting the a-substituent protected amino acid with a difunctional linker having a free 
20 amino group to form an amide linkage between the linker's free amino group and the unprotected side chain 
carboxy substituentof the amino acid using any of the condensationmethods described in Section (B)(II)(i)(a) 
above. The allyl-protected a-carboxy substituent of the derivatized amino acid residue is then removed by 
reduction to permit incorporation of the derivatized amino acid into the peptide chain. 

It is desirableto protect one of the functional groups on the difunctional linker either before the linker 
25 is coupled to the flanking residue that is selected to carry the linker or after the coupling but before the addition 
of the linker-coupled flanking residue to the peptide chain. This improves yield by avoiding unwanted 
reaction of the free functional group on the flanking residue-coupledlinker during peptide synthesis. The free 
functional group on the linker can be blocked with any of the amino or carboxy protective groups described 
in Section (B)(IIXl)(a) above. In one embodiment, the free functional group on the linker and the a-amino 
30 groups are orthogonally protected such that the a-amino groups can be deprotected in peptide synthesis 
without deprotectingthe free functional group on the linker. It will be appreciated that any of the foregoing 
procedures for coupling difunctional linkers to flanking residues can be easily modified to derivatize a 
particular flanking residue with a selected orthogonally monoprotected difunctional linker. 

In one embodiment designed for use in conjunction with Fmoc chemistry, an orthogonally 
3 5 monoprotected difunctional linker-derivatized flanking residue is created by obtaining from a commercial 
source an amino acid residue with an Fmoc-protecteda-amino substituent, a t-butyl ester-protecteda-carboxy 
substituent, and an unprotected side chain amino substituent, and then reacting the a-substituent protected 
amino acid with a difunctional linker carryinga free carboxy group and either an allyl-protectedcarboxy group 
or an allytoxycarbonyl-protected amino group to form an amide linkage between the linker's free carboxy 
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group and the unprotected side chain amino substituent of the amino acid using any of the condensation 
methods described in Section (B)(H){l)(a) above. The t-butyl ester-protected a-carboxy substituent of the 
derivatizedamino acid residue is then removed by acidolysistopennftmcorporationofthe derivatizedammo 
acid into the peptide chain. 

5 In another embodiment designed for use in conjunction with Fmoc chemistry, an orthogonally 

monoprotected afunctional Iinker-derivatized flanking residue is created by obtaining from a commercial 
source an amino acid residue with an Fmoc-protected a-amino substituent, an aliyl-protected a-carboxy 
substituent, and an unprotected side chain amino substituent, and then reacting the a-subsntuent protected 
amino acid with a Afunctional linker carrying a free carboxy group and either a Boc-protected amino group 

10 ora t-butylester-protectedcarboxy group to form an amide linkage between the linker's free carboxy group 
and the unprotected side chain amino substituent of the amino acid using any of the condensation methods 
described in Section (B)(II)(l)(a)above. The allyl-protected a-carboxy substituent of the derivatized amino 
acid residue is then removed by reduction to permit incorporation of the derivatized amino acid into the 
peptide chain. 

15 In one embodiment designed for use in conjunction with Boc chemistry, an orthogonally 

monoprotected difunctional iinker-derivatized flanking residue is created by obtaining from a commercial 
source an amino acid residuewith an Boc-protected a-amino substituent, a Fm ester-protected a-carboxy 
substituent, and an unprotected side chain amino substituent, and then reacting the a-substituent protected 
amino acid with a difonctional linker carrying a free carboxy group and either an allyloxycarbonyl-protected 

20 amino group or an allyl-protectedcarboxy group to form an amide linkage between the linker's free carboxy 
group and the unprotected side chain amino substituent of the amino acid using any of the condensation 
methods described in Section (BXIIKl Xa) above. The Fm ester-protected a-carboxy substituent of the 
derivatized amino acid residue is then removed by base saponification to permit incorporation of the 
derivatized amino acid into the peptide chain. 

25 In another embodiment designed for use in conjunction with Boc chemistry, an orthogonally 

monoprotected difunctional Iinker-derivatized flanking residue is created by obtaining from a commercial 
source an amino acid residue with an Boc-protected a-amino substituent, an allyi-protected a-carboxy 
substituent, and an unprotected side chain amino substituent, and then reacting the a-substituent protected 
amino acid with a difunctional linker carrying a free carboxy group and either a Fmoc-protected amino group 
30 or a Fm ester-protectedcarboxy group to form an amide linkage between the linker's free carboxy group and 
the unprotected side chain amino substituent of the amino acid using any of the condensation methods 
desmbedm Section <BXIIXlXa)above. The allyl-protected a-carboxy-substituentofthe derivatized amino 
acid residue is then removed by reduction to permit incorporation of the derivatized amino acid into the 
peptide chain. 

35 In one embodiment designed for use in conjunction with Fmoc chemistry, an orthogonally 

monoprotected difunctional Iinker-derivatized flanking residue is created by obtaining from a commercial 
source an amino acid residue with an Fmoc-protecteda-amino substituent, a t-butyl ester-protected a-carboxy 
s ubstituent,and an unprotected side chain carboxy substituent, and then reacting the a-substituent protected 
amino acid with a difunctional linker carrying a free amino group and either an allyloxycarbonyl-protected 
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amino group or an ailyl-protected carboxy group to form an amide linkage between the linker's free amino 
group and the unprotected side chain carboxy substituent of the amino acid using any of the condensation 
methods described in Section (B)(D)(l)(a) above, -The t-butyl ester-protected a-carboxy substituent of the 
derivatizedarnino acid residue is then rernoved-by acidoiyststo permit incorporation of the derivatized amino 

5 acid into the peptide chain- 
In another embodiment designed for use in conjunction with Fmoc chemistry, an orthogonally 
monoprotected difunctional linker-derivatized flanking residue is created by obtaining from a commercial 
source an amino acid residue with an Fmoc-protected ct-amino substituent, an ailyl-protected a-carboxy 
substituent, and an unprotected side chain carboxy substituent, and then reacting the oc-substituent protected 

1 0 amino acid with a difunctional linker carrying a free amino group and either a Boc-protected amino group or 
a t-butyi ester-protected carboxy group to form an amide linkage between the linker's free amino group and 
the unprotected side chain carboxy substituent of the amino acid using any of the condensation methods 
described in Section (B)(II)(1 )(a) above. The ailyl-protected a-carboxy substituent of the derivatized amino 
acid residue is then removed by reduction to permit incorporation of the derivatized amino acid into the 

1 5 peptide chain. 

In one embodiment designed for use in conjunction with Boc chemistry, an orthogonally 
monoprotected difunctional linker-derivatized flanking residue is created by obtaining from a commercial 
source an amino acid residue with an Boc-protected ct-amino substituent, a Fm ester-protected a-carboxy 
substituent, and an unprotected side chain carboxy substituent, and then reacting the a-substituent protected 

20 amino acid with a difunctional linker carrying a free amino group and either an allyloxycarbonyi-protected 
amino group or an atlyl-protected carboxy group to form an amide linkage between the linker's free amino 
group and the unprotected side chain carboxy substituent of the amino acid using any of the condensation 
methods described in Section (B)(H){l)(a) above. The Fm ester-protected a-carboxy substituent of the 
derivatized amino acid residue is then removed by base saponification to permit incorporation of the 

25 derivatized amino acid into the peptide chain. 

In another embodiment designed for use in conjunction with Boc chemistry, an orthogonally 
monoprotected difunctional linker-derivatized flanking residue is created by obtaining from a commercial 
source an amino acid residue with an Boc-protected ct-amino substituent, an ailyl-protected a-carboxy 
substituent, and an unprotected side chain carboxy substituent, and then reacting the a-substituent protected 

30 amino acid with a difunctional linker carrying a free amino group and either a Fmoc-protected amino group 
or a Fm ester-protected carboxy group to form an amide linkage between the linker's free amino group and 
the unprotected side chain carboxy substituent of the amino acid using any of the condensation methods 
described in Section (B)(H)( I )(a) above. The ailyl-protected a-carboxy substituent of the derivatized amino 
acid residue is then removed by reduction to permit incorporation of the derivatized amino acid into the 

35 peptide chain. 

In another aspect* the foregoing embodiments utilizing a difunctional linker-derivatized flanking 
residue can be modified by orthogonally protecting the side chain amide bond-forming substituent of the 
underivatized(notpre-coupledto difunctional linker) flanking residue with respect to the ct-amino protection 
chemistry used in peptide synthesis and with respect to any or all of the amide bond-forming substituents 
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found in the side chains of other amino acid residues in the peptide. In this aspect, the side chain amide bond- 
forming su bstituent of the underivatized, flanking residue can be selectively deblocked, yielding a peptide that 
can be cyclizedby a condensation reaction that is specifically targeted to be between the deprotected side chain 
amide bond-forming substituent of the underivatized, flanking residue and the free mnctional group of the 

5 Afunctional linker. Suitable methods for orthogonal protection of side chain amide bond-forming substituents 
are described in Section (B)(II)(lXa) above. 

Following completion of solid phase peptide synthesis, the peptide can be cyclized by a coupling 
reaction between the free functional group of the difunctional linker and the side chain amide bond-forming 
substituent of the underivatized, flanking residue as described in Section (B)(II)(l)(b) above. Any blocking 

10 group(s) protecting the underivatized, flanking residue's side chain amide bond-forming substituent and/or 
the free functional group of the difunctional linker is (are) removed, and the deprotected groups are coupled 
to form an amide linkage using any of the condensation methods described in Section (B)(II)(l)(a) above. 
Optionally, the resulting cyclized (constrained helix) peptide is cleaved away from the solid support recovered 
and purified. 

15 Alternatively, the peptide can be cleaved away from the solid support prior to the cyclization step. 

In one embodiment, after synthesis of the linear peptide chain is complete, the peptide is cleaved away from 
the solid support. The peptide is recovered, deblocked at the side chain amide bond-forming substituent of 
the underivatized, flanking residue and/or the free functional group of the difunctional linker, and then 
cyclized at low concentration in a reaction mixture in order to maximize intramolecuiaramide bond formation. 

20 Typically, a maximum level of intramolecular amide bond formation can be achieved under conditions in 
which the concentrationof the peptide provides an intramolecular concentration of free amide bond-forming 
substituents or groups that exceeds the intermolecular concentration of free amide bond-forming substituents 
or groups in the reaction mixture. In one embodiment, a peptide concentrationof 1 nM to 1 M f and preferably 
l uM to ImM, and more preferably I u.M to 1 00 \iM, is used to maximize cyclization. The cyclization of free 

25 peptide can be conducted with any of the condensation reactions used to heiicize solid phase peptide described 

. . above. . ■ " : 

III. Methods for Constructing Semisynthetic Locked Helix Proteins 

Also provided herein are semisynthetic proteins comprising locked helix peptides attached onto or 
incorporated in between one or more larger, recombinantly synthesized protein molecules. The semisynthetic, 

30 locked helix peptides of the invention can be made by any convenient method, including ligationof the locked 
helix peptides synthesized as described in Section (B)(II) above to one or more recombinantly synthesized 
protein sequences. For example,- protein ligases such as the lt subtiligases ,, can be : used to concatenate the 
locked helix peptides made as described herein to larger, recombinantly synthesized protein fragments. 

In one embodiment, the methods of the invention are modified in order to produce a locked helix 

35 peptide that functions as "first ligation substrate'* in the subtiiigase catalyzed peptide ligation methods 
described in international Patent Application No. PCT/US 9 1/05480 (WO 92/02615 published 20 February 
1992) or as "donor ester'\ "donor peptide", and "P n ...P 4 -P 3 -P 2 -P r glc-F-amide ester", respectively, in the 
subtiiigase catalyzed peptide ligation methods described ui Abrahmsenet ah, Biochem., 2Q: 4151-4 1 59(1991), 
Jackson et aL. Science. 2££ 243-247 (1994), and Chang et al., Proc. Natl. Acad. Sci. USA, 21f 12544-12548 
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(1994). The locked helix peptide can be synthesized such that the C-terminal amino acid residue of the 
cyclized peptide is in an ester linkage with the 2-hydroxyI group of a 2-hydroxycarboxylic acid, such as 
glycolic acid or iactic acid, to form a leaving group favored by the particular subtiligase of interest, i.e. such 
that the 2-hydroxycarboxylic acid ester, shown as the X residue of the "first ligation substrate" in Fig. 2B of 
5 WO 92/02615 t resemblesthe first residue positioned on the N-terminal side of the hydroiyzabie amide bond 
in the normal peptide substrate of subtilisin, shown as residue P{ of the "hydrolysis substrate" in Fig. 2B. 

In another embodiment, the leaving group comprises a 2-hydroxycarboxylic acid and another amino 
acid residue, shown as the R 2 " residue of the first ligation substrate in Fig, 2B of WO 92/026 15, wherein the 
carboxy group of the 2-hydroxycarboxylicacid residue is in an amide linkage with the a-amino group of the 
10 additional amino acid residue. In such embodiments, the amino acid residue in the leaving group can be 
selected to resemble the second residue positioned on the N-terminal side of the hydroiyzabie amide bond in 
the normal peptide substrate of subtiHsin, shown as residue P 2 ' of the "hydrolysis substrate" in Fig. 2B of WO 
92/02615. In a preferred embodiment, the leaving group is a giycolate-phenylalanyl (glc-F) moiety such as 
the glycolate-phenylalanyl-amide (glc-F-NH 2 ) moiety described in Example 2 of WO 92/02615. 
1 5 in one aspect, the glc-F leaving group is placed in its proper position at the C-terminus of the locked 

helix peptide by obtaining a Boc- or Fmoc-a-amino protected phenylalanine, linking the a-amino protected 
phenylalanine to solid phase resin with an ct-carboxy ester or amide linkage, deprotecting the protected a~ 
amino group, adding a glycolic acid residue in the form of a t-butyi ether to form an amide linkage between 
the carboxy group of the glycolic acid and the free a-amino group of the solid phase phenylalanine, removing 
20 the t-butyl ether group from the glycolic acid residue with acid and forming an ester linkage between the free 
hydroxyl of the glycolic acid residue and the a-carboxy of the next amino acid residue in the C-terminal 
sequence desired for the locked helix peptide. Subsequent amino acids can be added and the resulting peptide 
can be helicized according to any of the above described methods which utilize standard Boc or Fmoc 
chemistry for peptide synthesis. In one embodiment, a g!c-F-NH 2 leaving group is incorporated into the 
25 desired peptide chain essentially as described in Example 2 of WO 92/02615 or as described in Jackson et bL, 
Science . 266 : 243-247 (1994). 

In yet another embodiment, the "donor peptide" includes a flexible linker sequence between the C- 
terminal residue of the locked helix peptide sequence and the leaving group sequence, such as a di- or tri- 
glycine linker, to promote flexibility and accessibility of the donor peptide's leaving group to subtiligase. 
30 After the donor peptide (with the helix locking tether in place) is obtained, a subtiligase can be used 

to iigate a peptide or protein fragment (produced by recombinant or other synthetic methods), designated the 
"second ligation substrate" in Fig, 2C of WO 92/02615, the "acceptor peptide" in Fig. 1 on page 244 of 
Jackson et ah, Science . 266: 243-247 (1994), and the "Nucleophile" peptide in the synthetic scheme on page 
12545 of Chang et a!., &££ Nart Acad. Sci. USA . 9k 12544-12548 (1994), to the C-terminus of the donor 
35 peptide by displacement of the leaving group according to any of the subtiligase-catalyzed peptide ligation 
methods described above. In embodiments using acceptor peptides or proteins having a relatively inaccessible 
N-terminus due to higher order protein structure, ligation efficiency can be improved by altering the design 
of the acceptor peptide to incorporate a flexible linker sequence, such as a di- or tri-glycine sequence, at the 
N-terminus to promote flexibility and accessibility of the acceptor peptide N-terminus in the peptide ligation 
7 
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reaction. Alternatively, the accessibility of the acceptor peptide N-terminus and/or donor peptide C-terminus 
to subtil igase can be improved by conducting the ligation reaction under denaturing conditions which 
eliminates unfavorable structural conformations that may be assumed by the peptide substrates. In such 
embodiments, it is preferable to use a denaturation-stable subtiligase, such as the "stabiligase" described in 
5 Chang et al. t supra {capable of retaining nearly 50% of catalytic activity in 4 M guanidine hydrochloride). 
It will be appreciated that additional peptides can be synthesized with a suitable leaving group at the 
C-terminus and successively ligated to the N-terminus of the semisynthetic peptide containing the locked helix 
moiety by repeating the foregoing procedures until a completed peptide with the desired N-terminus is 
obtained. 

!0 in the event that the completed, semisynthetic, locked helix protein is obtained in a denatured, 

incorrectly folded, or otherwise inactive form as a result of the synthetic procedures used, the inactive species 
can be refolded into the native or active conformation by renaturation techniques that are well known in the 
art. Typical renaturation procedures use a chaotrope, such as urea at high pH or guanidine hydrochloride, to 
unfold inactive material followed by dilution of the denaturant to permit refolding to occur, while preventing 

15 the formation of random disulfide bonds prior to the assumption of the biologically active conformation 
throughnon-covalent,intramolecuiar interactions (see, U.S. Pat. Nos. 4,512,922; 4,518,256; 4,51 1,502; and 
4,511,503). Reversed micelles or ion exchange chromatography are used to assist refolding of denatured 
proteins by enclosing a single protein molecule within micelles or isolating proteins on a resin and then 
removing the d^^^ 966-975 (1990); Creighton in Protein Structure , 

20 Fading and Design . Oxender, D.L., ed., Alan R. Liss, Inc. (New York: 1985), pp. 249-251. In addition, 
conformation-specific refolding can be performed with Hgands and antibodies to the native structure of the 
(Cleland ™A w*ng> in Riotechnologv. Rehm» H.-J., and Reed, G.. eds, VCH (New York), pp. 528-555. 
Smce they are more likely to interact with the protein in its native conformation, these binding molecules can 
be used to guide the folding reactions towards native state protein. The foregoing recovery methods are 

25 regarde d as being universally applicable, with minor modifications, to the recovery of biologically active 
recombinant proteins from inclusion bodies, and are equally applicable to the recovery of biologically active 

; IV. Methods for Constructing Macromolecule-Bound Locked Helix Peptides 

In one embodiment, the constrained, helical peptides of the invention bound to a macromolecular 

30 solid support can be obtained by constructing the locked helix peptides with the solid phase synthesis 
techniques described in Section (II) above and recovering the intact, solid support-peptide conjugate. 
Alternatively; the cyclized peptide can be cleaved away from solid phase following synthesis and then attached 
to the macromolecule of choice by any convenient method known in the art. For example, a commonly 
employed technique for attachingpeptide ligands to polysaccharidematrices, e.g. agarose, dextran or cellulose, 

35 involves activation of the carrier with cyanogen halides and subsequent coupling of the peptide's primary 
aliphatic or aromatic amines to the activated matrix. The activation of poly saccharides with cyanogen bromide 
(CNBr) at alkalinepH was introduced to affmity chromatography by Axen el al„ Nature, 2U\ 1302 (1967). 
In one aspect of the invention, the activation of polysaccharide matrices, particularly agarose matrices, is 
performedaccordingtothetitratidn-activation method. In this procedure, for example, 20 g of exhaustively 
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washed moist agarose cake is added to 20 ml of water in a 100 ml beaker equipped with a <H00°C 
thermometer, a pH meter and a 25 mm magnetic stirring bar, The suspension is stirred slowly, the temperature 
lowered to about 1(M5°C by the addition of crushed tee and the pH adjusted to 1Q.8±0. 1 by the addition of 
1 -2 drops of 4 N NaOH. The activation procedure is initiated by the addition of the CNBr and the pH of the 

5 reactionmaintainedat 10.S±0.1 by manual titration with the 4 N NaOH. The CNBr (100 g/mg moist weight 
gel) can be added as a crystalline solid, a crushed solid, an aqueous solution or by adding an aliquot of a stock 
solution. The latter can be prepared by dissolving CNBr in acetonitriie (1 g/ml) and storing in a tightly 
stoppered vial at -20°C. The temperature is subsequently allowed to rise to 1 8-20°C 

Despite the relative simplicity of the titration method, it may be preferable to use the faster and 

10 technically simplified method of March et aL, Anal Biochem., 60: 149 (1974). The activation procedure is 
performed in concentrated carbonate buffer. The required amount of washed gel is suspended in an equal 
volume of 2 M NaHC0 3 -NaC0 3 buffer (pH 10.9) In a beaker equipped with a thermometer and magnetic 
stirring bar. The slurry is cooled to approximately 4-5 °C, me activated gel is transfen-ed to a smtered funnel 
and washed. 

15 The concentration of CNBr recommended in the procedures described above is satisfactory for 

moderate levels of peptide substitution. When lower or higher levels of activation are required, 50 mg and 
200-300 mg CNBr/gmoist weight gel respectively can be employed together with 2 M and 8 M NaOH for the. 
titration. 

It is generally recognized that the CNBr-activated intermediate functional groups of polysaccharide 

20 gels display limited stability and therefore it is preferable that the gel be washed as rapidly as possible prior 
to transferring the gel to the coupling-reaction medium. At the end of the activation step, the gel is rapidly 
cooled by the addition of crushed ice and poured into a large sintered glass funnel which has been pre-cooled 
with crushed ice. The suspension is rapidly filtered into a Buchner flask (2 liter) containing solid ferrous 
sulfate to remove unreacted CNBr and cyanides as harmless ferrocyanide. The gel is subsequently washed 

25 under suction with 1 liter ice-cold distilled water and 1 liter of the buffer to be used in the coupling stage, 
typically ice-cold 0, 1 M NaHC0 3 -NaCO 3 buffer (pH 8.5-9.5). 

CNBr-activatedSepharose 4B is available commercially from Pharmacia and obviates the hazardous 
manipulation of CNBr, The activated gel is freeze dried in the presence of dextran and lactose to preserve the 
beaded form and supplied in 15 g air-tight packs. The required amount of freeze-dried powder is swollen in 

30 ImM HC1 on a glass filter and washed with at least 200 mi of the same solution per gram of powder. 1 g of 
freeze-dried material is roughly equivalent to 3,5 ml final gel volume. The peptide iigand-binding capacity 
of the gel is conserved more^effecttvely by washing with solutions of low pH than with solutionsofpH greater 
than 7. The gel is then ready to couple peptide ligand as soon as the washing is completed. 

Pharmacia also markets CNBr-activated Sepharose 6 MB for use in cell biology and immunology for 

35 the separation of "functionallyhomogeneouscell populations". It is produced by activation of Sepharose6MB 
macrobeads(diameter200-300 fim) with cyanogen bromide and is handled in a manner analogous to CNBr- 
activated Sepharose 4B. 

The peptide to be coupled is suspended in a volume of the cold buffer equal to the volume of the 
packed gel, added to the moist, washed gel, and then the suspension is immediately mixed (in a Buchner 
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funne!) with a glass stirring rod. The entire procedure of washing, adding the peptide solution, and mixing 
preferably consumes less than 90 seconds. The suspension is transferred from the Buchner funnel to a beaker 
containinga magnetic mixing bar and is gently stirred at 4°C Although the reaction is essentially complete 
in 2 to 3 hours, the mixture is allowed to stand at 4°C for 16 to 20 hours to insure complete loss of reactive 

5 polysaccharide groups. The peptide-linked gel is then washed with large volumes of water until it is 
established that peptide is no longer being removed. 

The quantity of peptide coupled to the polysaccharide gel can in part be controlled by the amount of 
peptide added to the activated matrix. When highly substituted polysaccharide gel derivatives are desired, the 
amount of peptide added should be 20 to 30 times greater than that which is desired in the final product. For 

10 ordinary procedures, 100 to 150 mg of cyanogen bromide are used per ml of packed polysaccharide gel, but 
much higher coupling yields can be obtained if this amount is increased to 250 to 300 mg. The pH at which 
the coupling reaction is performed also affects the degree of coupling, since it is only the unprotonated form 
of a peptide's amino groups that reacts with CNBr-activated polysaccharides. Preferably, the N-terminal ot- 
amino group of the peptide ligand is used for coupling with the activated polysaccharide matrix. a~amino 

15 groupswillcoupleoptimallyatapHofabout9.5to 10.0. If coupling at the e-amino group(s) of the selected 
peptidetligand(such as the e-amino groups of the lysinyl residues) is desired, the coupling reaction should be 
conducted at a pH value of about 10.0, and a large excess of peptide should be added. If coupling at the 
aromatic amino groups in the histidyl or tryptophanyl residues of the selected peptide is desired, very high 
coupling efficiency can be obtained at pH values between 8 and 9. 

20 Further details of the invention can be found in the following examples, which further define the 

scope of the invention. All references cited throughout the specification, and the references cited therein, are 
hereby Expressly incorporated herein by reference in their entirety. 
J EXAMPLES 
| EXAMPLE 1 

25 Experimen tal Section 

i Computation al Methods 

j All calculations were performed with the DISCOVER program (Biosym Technologies, San Diego) 
using the all-atom AMBER force field (Weiner, S. J.; Kollman, P. A.; Case, D. A.; Singh, U. C; Ghio, C; 
Alagona, G.; Profeta, S., Jr.; Weiner, P. J. Am. Chem. Soc. 1984, 106, 765-784; Weiner, S. J.; Kollman, P. A.; 
30 Nguyen, D. T.; Case, D. A.J. Comp. Chem. 1986, 7, 230-252) with a distance dependent dielectric constant 
(€=4rj! 

Synthesis ' ! 
i Materials and Methods 
Peptides were synthesized using standard solid phase synthesis techniques (Merrifield, R.B. J. Am. 
35 Chem. Soc. 1963, 85, 2 149-2 154; Kaiser, E.; Colescot.RX.; Bossinger,C.D.; Cook, P.l. Anal. Biochem. 1970, 
34, 59|5-598). Organic chemicals were purchased from Aldrich (Milwaukee Wl) or Fluka (Ronkonkoma, 
NY). Protected amino acids were purchased from Bachem CA (Torrance CA) or Peninsula Labs (Belmont 
CA). BOP (benzotriazole-l-yl-oxy-tris [dimethylamino] phosphonium hexafluorophosphate) was purchased 
from Richelieu Biotechnologies (Montreal). Solvents were purchased from Baxter (McGaw Park IL), Baker 
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(Phillipsburg NJ), or Mallinckrodt (Paris KY). Polystyrene supports were purchased from Advanced 

ChemTech (Louisville KY). 

Mono-t-butyloxycarbonyl (BOC) 1,3-propanediamine was prepared as follows. 2-(terf- 
butoxycarbonyloxyimino)-2-phenylacetonitrile (18 g, 73 mmol) was added portionwise over 10 minutes to 

5 a solution of l,3-diaminopropane(12.5 g, 184mmol)in lOOmLof tetrahydrofiirancooled to 0° C. Afterfour 
hours at 0° C the reaction was allowed to warm to 25° C for two hours. The reaction was diluted with 1 SO mL 
of ethyl acetate and washed twice with 1 00 mL of saturated aqueous sodium chloride. The organic phase was 
extracted with three 100 mL volumes of 10% aqueous citric acid, the combined aqueous portions were then 
washed twice with 100 mL of ethyl acetate. The aqueous phase was cooled in an ice bath and the pH was 

10 adjusted to approximately 13 with 50 % sodium hydroxide. The basic aqueous phase was then extracted with 
three 1 00 mL volumes of dichloromethane.The organic portion was then dried with potassium carbonate and 
filtered. Soivent was removed by rotary evaporation to yield mpno-Jert-butyloxycarbonyl-1,3- 
diaminopropane. 

Peptides were purified by reverse-phase HPLC on a Vydac C-18 column, eiuted with acetonitrile- 
1 5 water gradients containing 0.1% v/v trifluoroacetic acid (TFA), Peptides were characterized by electrospray 
MS on a PE SCIEX API 111+ tripie quadrupolemass spectrometer and by quantitative amino acid analysis on 
a Beckmann 6300 automated amino acid analyzer. Organic intermediates were analyzed by l H and 13 C 
nuclear magnetic resonance (1MMR) on a Varian VXR-300S and by high-resolution mass spectrometry (MS) 
on a JEOL JMS-HX1 10HF/HX1 10HF tandem mass spectrometer. 
20 AcTNE(OFm)DLAARRE(OAUyl)QQna-MBHA-polystyrene (la): 

Linearpeptide la with the sequence shown was synthesizedon p-MBHA resin (4.25 grams (g), 0.57 
milliequivalents/gram (meq/g), 2.42 millimoles (mmol)) using standard coupling cycles with three molar 
equivalents of BOC-amino acid, 3.3 molar equivalents of BOP and 3.3 molar equivalents of N- 
methylmorpholinein dichloromethane(CH 2 Cl 2 ) and dimethyl acetamide (DMA) if needed for solubility, for 
25 one hour at room temperature. The N-acetyl cap was attached by treatment with 5 milliliters (mL) of acetic 
anhydride in 3% triethylamine(TEA) in CH 2 C1 2 for 20 minutes at room temperature. The resin was dried and 
weighed (4.41 g, estimated at 0.22 meq/g). 

AcTNE(OFm)DLAARRE(OFm)QQnh-MBHA-polystyrene (If): 
AcAEE(OFm)AAAKFLE(OAllyl)AHAnh-MBHA-polystyrene(2a): 
30 Linear peptides If and 2a as shown were synthesized as described above for la. 

AcTNQ(Y-NHCH 3 )DLAARRQ(Y-NHCH 3 )QQnh 2 (lb): 

Linearresin-boundpeptide If (0.60g, 0.17 mmol) was doubly deprotected with 20% piperidine/DMA for 20 
minutes. The free carboxylic acids were coupled to methylamme(CH 3 NH 3 Cl, 0.26 g, 3.85 mmol) with BOP 
(1.57 g, 3.55 mmol) and N-methy]morpholine(0.90mL, 8.2 mmol) in CH 2 C1 2 /DMA for 1 .5 hours. The resin 
35 was washed and dried, and the peptide-resin bond was cleaved with anhydrous hydrofluoric acid (HF) ( I OmL) 
at 0°C for one hour with anisoie (1 mL) and ethylmethylsulfide (EtSMe) (0.5 mL) as scavengers. The resin 
was washed twice with ether, once with ethyl acetate, and again with ether. The free peptide was then 
extracted from the resin with sequential washes of 10% acetic acid, glacial acetic acid, acetomtrile, 1 0% acetic 
acid, and water. The combined solutions were lyophilized and the residue was purified. 
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_CH2 CTH 2 CI H 2- 



cydo-AcTNQ(Y-NH)DLAARRQ{Y-NH)QQnh 2 (lc): 
I .Using unprotected propanediamine: 

Linear peptide la on the resin (0.51 g.O.l 1 mmol) was deprotectedatthe fluorenylmethyl ester with 

5 20% piperidine/DMA for 20 minutes and the resulting piperidine salt was neutralized by washing twice with 
l%TFAinCH 2 Cl 2 . The free carboxyiic acid was coupled to l,3-propanediamine(0.l2mL, 1.44mmoI) with 
BOP (0.40 g, 0.90 mmol) and diisopropyiethylamine(DIPEA) (0.17 mL, 0.98 mmol) in CH 2 C\ 2 for one hour, 
followed by addition of DMA and continued coupling for an additional 45 minutes. The glutamic acid allyl 
ester was deprotectedwith tetrakis (triphenylphosphine)palIadium(0)(Pd(PPh3) 4 ) (0.2 1 g, 0. 18 mmol) in 20% 

1 0 piperidine/DMA for 1 .5 hours and the piperidine was removed by washing twice with 1 % TFA in CH 2 C1 2 . 
The resulting amino acid was cyclized with BOP (0.32 g, 0.72 mmol) and DIPEA (0.13 mL, 0.75 mmol) in 
CH 2 C1 2 for 3.5 hours. A Kaiser test gave a noticeable purple color, so the cyclization was repeated with BOP 
(0.44 g, 0.99 mmol) and DIPEA (0.19 mL, 1 .09 mmoi) in CH 2 CI 2 for three days. The peptide was cleaved 
from the resin as described above for lb. 

15 ?.. Using mono-BOC propanediamine: 

Linearpeptide la on the resin (0.57 g, 0.13 mmol) was deprotectedatthe fluorenylmethyl ester with 
20% piperidine/DMAfor 0.5 hour and the resulting piperidine salt was neutralized by washing twice with 1 
% TFA in CH 2 C1 2 . The free carboxyiic acid was coupled to mono-rer/-butyloxycarbonyl-l,3-propanediamine 
(0.23 g, 1.32 mmol) with BOP (0.52 g, U18 mmol) and DIPEA (0.2S mL, 1.44 mmol) in CH 2 C1 2 /DMA for 

20 one hour. The glutamic acid allyl ester was deprotected with Pd(PPh 3 ) 4 (0.21 g, 0.18 mmol) in 20% 
piperidine/DMA for 1.5 hours and the piperidine was removed by washing twice with 1% TFA in CH 2 CI 2 ; 
the Kaiser test was negative at this point. The BOC group was removed with TFA/CH 2 Cl 2 /anisole/l,2- 
ethanedithiol (45:45:5:5 vol/vol); the free amine then.gave a positive Kaiser test. The resulting amino acid 
was cyclized with BOP (0.58 g, 1,3 1 mmol) and DIPEA (0.30 mL, 1 .72 mmoi) in CH 2 C1 2 for two hours, 

25 whereupon the Kaiser test gave only a faint blue-green color. The peptide was cleaved from the resin as 
described above for lb. 

AcAEQ(Y-NHCH 3 )AAAKFLQ(Y-NHCH 3 )AHAnh 2 (2b): 

Linear peptide 2a on the resin (0.60 g, 0.19 mmol) was deprotected at both the ally! and the 
fluorenylmethyl esters with Pd(PPh 3 ) 4 (0.1 g, 0.09 mmol) in 20% piperidine/DMA for 30 minutes and the 
30 resulting piperidine salt was neutralized by washing with 50% TFA in CH 2 C1 2 containing anisole and 1,2- 
ethanedithiol. The free carboxyiic acids were coupled to methylamine (40% aqueous, 0.16 mL, 1.86 mmol) 
with BOP (0.32 g, 0.72 mmol) and DIPEA (0.35 mL, 2.01 mmol) in CH 2 C! 2 /DMA for 1 hour. The peptide 
was cleaved from die resin as described above for lb. 
Id, le, 2c, 2d, 2e: 

35 Id and le were prepared from la and 2c-2e were prepared from 2a by the same procedures as 

described above for lc using unprotected 1,3-propanediamme, coupling with 1,4-butanediamine for Id and 
2d and with 1,5-pentanediamine for le and 2e. 
AcTNk(S-Acm)DLAARRK(S-Acm)QQnh 2 (3a): 
AcAEk(S-Acm)AAAKFLK(S-Acm)AHAnh 2 (4a): 
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Linear peptides3a and4a weresynthesizedbystaridardMerrifieldtechniquesusingFMOCchemistry 
(Atherton,E.;Sheppard,R.C.J. Chem. Soc. Chem. Commun. 1985, 165-166). Fmoc-D-Thiolys(Acm)-OH 
(7, k(S-Acm)) and Fmoc-L-Thiolys(Acm)-OH (10, k(S-Acm)) were prepared as described below. 



AcTNk(S)DLAARRK(S)QQnh 2 (3b): 



AcAEk(S)AAAKFLK(S)AHAnh 2 (4b): 

Cyclic peptides 3b and 4b were prepared from 3a and 4a respectively by simultaneous deprotection 

and oxidation. Approximately 16 mg of Acm-protected peptide was dissolved in 1.5 mL of water containing 

1 0 1 0% acetic acid and then diluted to 50 mL total volume with trifluoroethanol to give a final concentration of 

approximately 200 micromoles/Iiter (uM). A total of 12 mL of a 6 miltimoles/liter (mM) solution of iodine 

(80 milligrams(mg) dissolved in 3 mL of acetic acid and diluted to 50 mL with trifluoroethanol) was added 

in 1 mL portions over the course of 10 hours while the reaction progress was monitored by HPLC. When the 

starting material had been consumed, the reaction was diluted with water and lyophilized and the crude 

15 oxidized material was purified. 

(2S3R)-2,5-dihydro-3,6-diethoxy-2-isopropyl-5-(4-bromobutyl)pyrazine(5): 

n-butyllithium(13.3 mLof a 1.6M solution in hexanes, 21,3 mmol) was added to a solution of (2S)- 
2,5-dihydro-3,6-diethoxy-2-isopropyl pyrazine (Schollkopf reagent)(4.30g, 20.3 mmoi)in tetrahydrofuran 
(THF) over the course of five minutes. The solution was maintainedat-78°C for 15minutes afterwhich 1,4- 
20 dibromobutane (9.75 mL, 81 .2 mmol) was added in a single portion. After 2.5 hours at -78°C the reaction 
was allowed to warm to room temperature and diluted with diethyl ether (100 mL). The organic phase was 
washed with water (100 mL), brine (100 mL) and then dried with magnesium sulfate (MgS0 4 ). Following 
filtration and concentration most of the residual 1 ,4-dibromobutane was removed under high vacuum. The 
remaining oil was purified by silica gel chromatography (2% ethyl acetate in hexanes) to provide 5 (3.7 g, 

25 57%)asacolor!essliquid;[a3 25 D -I-13 0 (c=4.5,CHC] 3 );IR(thinfilm)2957, 1689, 1456,1364.1304,1230, 
1144,1038 cm" 1 ; ^H NMR (300 MHz, CDClg) 8 4.04-4. 1 8 (m, 4H) 3.94-4.02 (m, IH) 3.90 (t, J=3.9, IH) 
3.39 (t, J-6.9, 2H) 2.21-2.32 (m, IH) 1.68-1.92 (m, 4H) 1.33-1.47 (m, 2H) 1.274 (t, J=7.2, 3H) 1.268 (t, 
J=7.2,3H) 1.03 (d,J=6.9,3H) 0.70 (d,J=6.9,3H); 1 3 CNMR( 100.6 MHz, CDCl 3 )d 163.12,163.05, 60.72, 
60.48, 55.09, 33.60, 33.1 1, 32.66, 31.80, 23.22, 19.03, 16.6, 14.34, 14.31; Mass Spectrum (FAB+) 347.1 

30 (MH+). 

(2S,5R)-2,5-dihydro-3,6-diethoxy-2-isopropyl-5-(4-4-methoxybenzyl)-thiobutyl) pyrazine (6): 

Potassium rew-butoxide (1 1.8 mL of a 1 M solution in THF) was added over five minutes to a 

solution of 4-methoxy-a-toluenethiol (1.85 mL of 90%, 1 1 .8 mmol) in THF (20 mL) at 25°C, generating a 

thick white precipitate which was stiired for 25 minutes. A solution of 5 (3 .7 g, 1 0.7 mmol) in THF (20 mL) 

35 was added and stirring continued for three hours. The reaction was concentrated by rotary evaporation and 

then partitioned between water (50 mL) and diethyl ether (100 mL). The organic portion was washed with 

brine, dried (MgSOA and concentrated. The residue was purified by silica gel chromatography ( I % increasing 

25 o 

to 2,5% ethyl acetate in hexanes) to provide 6 (2.90 g, 91%) as a colorless oil; [a] D -3.77° (c-3.5, 
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CHCI3); IR (thin film) 2957, 1689, 1510, 1238, 1038 cm" 1 ; 1 H NMR (300 MHz, CDCI3) 5 721 (d,J=8.7, 
2H) 6.84(d, J=8.4,2H)4.02^.20(m, 4H)3.93-3.99{m, lH)3.88(t, J=3J, 1H) 3.80 (s, 3H) 3.65 (s,2H) 239 
(t, J=7.5, 2H) 2.20-2.32 (m, ! H) 1 .64-1 .82 (m, 2H) 1.50-1 .62 (m, 2H> 1 .22-1 .38 (m, 2H) 1 .27 (t, J=7.2, 6H) 
1.03 (d, J=6.9, 3H) 0.70 (d, J=6.9, 3H); 13 CNMR(100.6 MHz CDCI3) 6 163.20, 163.00, 158.50, 130.57, 
5 129.80. 113.80, 60.67, 60.45, 60.40. 55.24, 55.20, 35.57, 33.68, 31.74, 31.14, 29.19, 23.92, 19.05, 16.60, 
14.36. 14.32; Mass Spectram (FAB+) 421.2 (MH+). 

(2R) 2-{9-Fluorenylmethoxycarboiiyl)ainino-6-acetaiiiidoinethyithiohexanoic acid (D-Thio(Acin)lysine) 
(7): 

Water (30 mL) and 3N HC1 (6.5 mL) was added to a solution of 6 (2.90 g, 9.32 mmol) in THF (50 
10 mL). Hie mixture was stirred at 25'C for 12 hours and then the THF was removed by rotary evaporation. 
The solution was adjustedto pH 10 with aqueous potassium carbonate (K 2 C0 3 ) and then extracted twice with 
ethyl acetate (75 mL). The organic extracts were dried (MgS0 4 ), concentrated and the residue was purified 
by silica gel chromatography (1:1 increasing to 2: 1 ethyl acetate/hexanes with 0.5% triethylamine) to yield 
crude S-methoxybenzylthiolysineethyl ester (2.65 g, 91%) as a colorless oil which was carried on directly to 

1 5 the next step. This ester (3.02 g, 9.7 1 mmol) was dissolved in a mixture of trifluoroacetic acid (50 mL) and 
anisole ( 1 mL) at 0 e C, and mercuric acetate (3.09 g, 9.71 mmol) was added to give a clear solution. After 15 
minutes the trifluoroacetieacid was ^ removed by" ^rotaiy evaporation and the residue was ^ 
(75 mL), then washed with diethyl ether (75 mL). The aqueous portion was treated with hydrogen sulfide 
(H 2 S) (bubbled through the solution) for 30 minutes and the resulting black precipitate was removed by 

20 filtrationthroughabedofcelite. The filtrate was concentrated by rotary evaporation, redissolved in water (20 
mL) and filtered through a 0,45 micrometer (urn) nylon filter. The solution was again concentrated and the 
residual foam was dried under high vacuum overnight The residue was dissolved in trifluoroacetic acid (15 
mL) and acetamidomemanol (Fluka, 0.95 g, 10.7 mmol) was added. After 45 minutes the reaction was 
concentratedby rotary evaporation and then dried under high vacuum overnight. The residue was dissolved 

25 in THF (20 mL) and cooled to 0°C. A solution of lithium hydroxide (1.22 g, 29.1 mmol) in water (20 mL) 
was added in two portions at 1 5 minute intervals. Aftertwo hours the reaction was allowed to warm to room 
temperature and the pH was adjusted to 7.0 with 1 mole/liter (M) aqueous citric acid. The solvent was 
removed by rotary evaporation, the residue was dissolved in dioxane (80 mL), and Fmoc N-hydroxy 
succinimidc(3.3 g, 9.71 mmol) was added followed by saturated aqueous sodium bicarbonate (NaHCOj) (15 

30 mL). Afteronehour.mesolventwasremovedbyrotaryevaporationandmeresidue 

... water (50 mL) and ethyl acetate (50 mL). The aqueous portion was adjustedto pH 2.5 with 1 M aqueous citric 
acid andmen extracted^ 

(MgSG 4 ) and concentrated. The residue was purified by silica gel chromatography (first with 2:1 ethyl 
acetate/hexanes with 0.5% acetic acid, then ethyl acetate with 0.5% acetic acid, then 5% methanol in ethyl 
35 acetate with 0.5% acetic acid), product containing fractions were concentrated from toluene (1 50 mL, three 
times) prior to dissolution in water with acetonitrile and iyophilization to provide 7 (3.16 g, 71% over four 
s tep S )asawhi te powder; [a] 25 D -18° (c=2, EtOH); IR (thin film) 2800-3400, 1709, 1536, 1260,759,740 
- 1 ; l H NMR (300 MHZ, DMSO-d6)6 8.42 (t, J=6.0, 1H) 7.88 (d, J=7.3,2H) 7.72 (d, J=7.5, 2H) 7.58 (d, 



cm 
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J=8.1, 1H) 7.407 (t, J=7.5, 2H) 7.32 (t, J-7.5. 2H) 4.16-4.30 (m, 5H) 3.91 (m, 1H) 2.53 (t, J=7.2, 2H) 1.82 
(s, 3H) 1.33-1.76 (m, 6H); 13 C NMR (100.6 MHZ, DMSO-d6) 6 174.00, 169.16, 156.08, 143.86, 143.80, 
140.71, 127.61, 127.05, 125.28, 120.08, 65.57, 53.90, 46.68, 30.54, 30.01, 28.79, 24.91, 22.54; High 
Resolution Mass Spectrum (FAB+) 457.1785', Entppm/mmu] -2.7/-1.2. 
5 (2R,5S>2,5-dihydro-3,6-dimethoxy-2-isopropyl-S-(4-bromobutaDe) pyrazine (8): 

[«] 25 D +|.73° (c=4.4,CHCl 3 );IR (thin film) 2959, 1696, 1458, 1434, 1237, 3 196, 1007 cm" 1 ; ^HNMR 

(300 MHZ, CDCI3) 5 3.98-4.07 (m, 1H) 3.95 (t, J=3.6, 1H) 3.70 (s, 3H) 3.68 (s, 3H)3.40 (t, J=6.9, 2H) 

2.19-2.32(m, 1H) 1.66-1 .94 (m,4H) 134-1 .48 (m,2H) 1.05 (d, J=6.9, 3H) 0.69 (d, J=6.9, 3H); 13 C NMR 

( 100.6 MHZ, CDCI3) 8 163.62, 163.60,60.77, 55.1 1, 52.31,33.54,33.10, 32.61, 31.73, 23.26, 19.02, 16.56; 

10 Mass Spectrum (FAB+) 319.1 (MH+). 

(2R,5S>2,5-dihydro-3,6-dimethoxy-2-isopropyl-5-(4-(4-metlioxybeiizyl)-thlobutyl) pyrazine (9): 

[a] 25 D +3.96° (c=3.63,CHCl 3 ); IR (thin film)2944, 1696, 1 510, 1238 cm" 1 ; 1 H NMR (300 MHZ, CDCI3) 
6 7.2 1 (d, J=8.7, 2H) 6.83 (d, J=8.4, 2H) 3.97-4.25 (m, 1H) 3.93 (t, J=3.3, 1H) 3.79 (s, 3H) 3.68 (s, 3H) 3.67 
(s, 3H) 3.65 (s, 2H) 2.39 (t, J-7.5, 2H) 2. 18-2.30 (m, 1H) 1 .60-1.84 (m, 2H) 1 .49-1 .62 (m, 2H) 1.23-1 .38(m, 
15 2H) 1.27 (t, J=7.2, 6H) 1.04 (d, J=6.9, 3H) 0.68 (d, J=6.9, 3H); 13 C NMR (100.6 MHZ, CDCI3) 6 163.72, 
163.47, 158.48, 130.55, 129.78, 1 13.78, 60.68, 55.22, 55.18, 52.27, 35.58, 33.63, 3 1 .64, 31.12, 29.13, 23.88, 
19.02, 16.51; Mass Spectrum (FAB+) 393.2 (MH+). 

(2S)2-(9-Fluorenylmethoxycarbonyl)ainino-6-acetamidoniethylthiohexanoi«acid: (L-Thio(Acm)lysine) 
(10): 

11 

20 [a} 25 D +1.3° (c=2, EtOH); IR (thin film) 2800-3400, 1709, 1536, 1260, 759, 740 cm" ; HNMR (300 
MHZ, DMSO-d6) 8 8.42 (t, J=6.0, 1H) 7.88 (d, J=7.3, 2H) 7.72 (d, J=7.5, 2H) 7.58 (d, J-^8.1, 1H) 7.407 (t, 
J=7.5, 2H) 7.32 (t, J=7.5,2H) 4.1 6-4.30 (m, 5H)3.91 (m, 1H) 2.53 (t, J=7.2, 2H) 1.82 (s,3H) 1.33-1.76(m, 
6H); 13 CNMR(100.6MHZ,DMSO-d6)6 174.00, 169.16, 156.08, 143.86, 143.80, 140.71, 127.61, 127.05, 
125.28, 120.08, 65.57, 53.90, 46.68, 30.54, 30.01, 28.79, 24.91, 22.54; High Resolution Mass Spectrum 

25 (FAB+) 457.1776, Err[ppm/mmu] -4.6/-2.1. 

These materials were prepared in the same manner as 5, 6, and 7 starting from (2R>2,5-dihydro-3 ,6- 
dimethoxy-2-isopropyl pyrazine (Merck). 
NMR Spectroscopy 

For each peptide, 2-4 mg of purified material was dissolved in 440 microliters (ul) of 25 mM d r 
30 sodium acetate containing 5% deuterium oxide (D 2 0) and 0. 1 milumoles/Uter (mM) sodium azide yielding 
a total peptide concentration of 1-6 mM; the pH was adjusted to 4.5 by microliter additions of 1 M sodium 
hydroxide (NaOH). All spectra were acquired at 5°C or 10°C on a Bruker AMX-500 spectrometer. Two 
dimensional COSY (Aue, W. P., Bartholdi,E. & Emst, R. R. J. Chem. Phys. 1976, 64, 2229-2246), ROESY 
(Bothner-By, A. A., Stephens.R. L., Lee, J.-m., Warren, C. D. & Jeanloz, R. W. J. Am. Chem. Soc. 1984, 106, 
35 81 l-S13;Rance, U.J. Magn. Reson. 1987, 74, 557-564) and TOCSY (Braunschweiler, L. & Emst, R. R. J. 
Magn. Reson. 1983, 53, 521-528; Bax, A. & Davis, D. G. J. Magn. Reson. 1985, 65, 355-360) spectra were 
acquiredwith phase discrimination in o>, achieved with TPP1 (Marion, D. & Wuthrich, K. Biochem. Biophys. 

-124- 



BNSOOCID: <WO 9820036A1_L> 



WO 98/20036 



PCT/US97/20069 



Res. Commun. 1983, 113, 967-974). Total acquisition times were approximately 2, 4, and 12 hours for 
COSY. TOCSY and ROESY spectra, respectively. Water suppression was achieved by coherent low power 
irradiation of the water resonance for the 1.5 second (s) recycle delay. ROESY and TOCSY spectra were 
acquired as described by Akke, ML, Skelton,- N. J., KordeL J. & Chazin, W. J. In Techniques in Protein 

5 Chemistry II; Vitlafranca, J. J., Ed.; Academic Press, Inc.: Boca Raton, FL, 1991; pp. 401^08; in addition, 
first-orderphase corrections were avoided by acquisition in a sine-modulated fashion in g>j . TOCSY mixing 
was achieved with a clean DIPSI-2rc sequence applied for 90 milliseconds (ms) (Cavanagh, J. & Ranee, M. 
J. Magn. Resort. 1992, 96, 670-678). The ROESY spectra were collected with a 4.0 kilohertz(kHz) spin-lock 
pulse of 200 ms duration. The spectra were processed and analyzed using the Felix software package (Biosym 

10 Technologies,San Diego, CA). 3 JHN-Ha were obtained by fitting an antiphase pair of Lorentzian lines to o> 2 
slices of high digital resolution COSY spectra. 

Amide proton exchange rates with solvent were measured for 1 b and 1 c by lyophilizing the peptide 
- from H 2 0 and acquiring a series of one dimensional (ID) NMR spectra immediately after dissolving the 
peptide in DjO. Exchange rate constants were determined by performing a three parameter exponential fit 

15 to the decaying amide signals. Protection factors were calculated as the ratio of exchange rate in the cross- 
linked and uncross-linked peptide. 

Structure Ca lculation •' 

NOESY (Kumar, A., Ernst, R. R. & Wilthrich, K. Biochem. Biophys. Res. Commun. 1980, 95, 1-6; 
Bodenhausen, G., Kogler, H. & Ernst, R. R. J. Magn. Reson. 1984, 58, 370-388) and ROESY data were 

20 collectedwith mixing times of 300 ms and 200 ms, respectively, from water (H 2 0) and D 2 0 using a sample 
of lc (approximately 8 mM). Total acquisition times were 24 hours per experiment. Distance restraints were 
generated from these data by categorizingcross-peaksas strong, medium, weak or very weak according to the 
integratedpeak volume, and assigning an upper bound of 2.9, 3.5, 4.6, or 5.6 angstroms (A), respectively, to 
the correspon ding interproton distance. The dihedral angle 4> was restrained between -90 ' and -40° for 

25 residues in which 3 J HN . H « less than 6.0 hertz (Hz). Values of 3 J H «-Hp were determined from a COSY 
spectrum acquired in D 2 0 solution with a 35' mixing pulse. Xi restraints and Hp stereospecific assignments 
were obtained for four side-chains on the basis of these coupling constants and the results of initial structure 
calculations(Skelton,N J Garcia, K C , Goeddet D V., Quan, C. &Bumier, J. P. Biochemistry 1994, 33, 
13581-13592). 

30 Structures were calculated using the program DGII using the CVFF force field parameters (Biosym 

Technologies, San Diego, CA). Input restraints consisted of 141 interproton distances, 9 c{> dihedral angle 
restraints and 5 X , dihedral angle restraints. Explicit hydrogen bonds were hot included. Structures were 
generated with triangle and tetrangle smoothing prior to perspective embedding of all atoms. The embedded 
structures were annealed for 10,000 steps in four-dimensional space while cooling from 200 degrees keivin 

35 (K.) with all atom massesset to 1000. The DG structures were refined by rMD using the AMBER force field 
within the DISCOVER program (Biosym Technologies). Structures were annealed at 600 K for 3 picoseconds 
(ps), cooled to 0 K over 1.8 ps and finally subjected to 200 cycles of rEM. Charges on Glu. Asp, Arg, C- 
terminal and N-tenninal residues were reduced to 0.2 e and a distance dependent dielectric pf l/4r was 
employed. Restraints were employed as square well potentials with force constraints of 25 
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kilocalories/mole/angstrorn 2 (kcal-moi* 1 A' 2 ) and 1 00 kilocalories/moie/radian 2 (kcai-mol-W 2 ) for distances 
and dihedral angles, respectively- In the final round of calculation, 60 structures were embedded in DGI1 and 
refined by rMD. 

Circular Dichroism * 
5 CD spectra were acquired on an Aviv 62 spectrometer with a 0.1 centimeter (cm) path length 

temperature-controlled cell. Solutions for analytical spectra were prepared by dilution of NMR samples to 
approximately 100 micromoles/liter (uM) with additional NMR buffer. Points were taken every 0.2 
nanometers (nm) with 0.2 nm bandwidth and 2 seconds (s) averaging time. The shortest wavelength attainable 
was limited by absorption of the acetate buffer. Curves shown are smoothed with standard parameters (10- 
10 point smoothing)- 

RRSTTTTS AND HTSCUSSION 

Design Considerations 

Given synthetic and geometric considerations, it was determined that amide chemistry should be used 
to link the / and 1+7 side-chains. Disulfide bonds, while synthetically feasible, introduced an unwanted 90° 

15 twist into the linkage. In orderto exploit the ability of simple alky! chain linkers to avoid steric crowding in 
the region near the 1+3 and 1+4 residues, linkage methods for bridging either Gin or Asn at I and /+ 7 with 
an alkanediyl chain were considered. Gin was chosen because its greater length allows use of the minimum 
size tether to link these side-chains. A representative set of protein crystal structures from the Brookhaven 
Database (Bernstein, F. C; Koetzle, T. R; Williams, G. J. B-; Meyer, E. F.; Brice, M. D,; Rodgers, J, R.; 

20 Kennard, O.; Shimanouchi.T.; Tasumi, M. J, MoL BioL 1977, / 12, 535-542) was searched for all occurrences 
of glutamine in an a helical context (with <j> = -60* ± 30° and * = ^5° ± 30°). The resulting data set was 
used to determine the side-chain rotamer distributions of naturally occurring helical glutamine residues. In 
general amino acid residues in an a helical context have %j = -60°, a conformation suitable for the 1+7 
position of a side chain linker. Glutamine has a relatively high population (14, 6%) of the x\ = 180° rotamer, 

25 representing a significant natural conformation that points the side chain towards the C terminal end of the 
helix. Rotamer combinations were identified that minimized the Ne2 - Ne2 distance between the / and /+ 7 
side-chains in a model helical peptide. Depending on % 3 values, distances ranging from 53A to 7A were 
foundifthe/ gtutammeassumesx ia nd x 2 anglesof 180° and 60° and the 1+7 glutamine assumes x\ and 

X2 values of -60° and 180°, respectively. 

30 Model building indicated that a 4^methylene "bridge" could efficiently link these two glutamine side^ 

chains without incurring unfavorable torsional interactions. Models of 3-, 4-, and 5-methylene-bridgedhetical 
peptides were constructedusing distance geometry methods {Quantum Chemistry Program Exchange, Program 
#590, entitled DGEOM by Blaney etal) followed by energy minimization. All residues except the linked 
glutamines were alanine. The conformationalstabilities of helical peptides were assessed using 1 nanosecond 

35 (ns> of unconstrained molecular dynamics at 298 K following an initial 100 picoseconds (ps) equilibration 
period during which harmonic restraints (25 kilocalories/mole/angstrom (kcal-mol A )) were applied to 
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15 



maintain helicity. As a control, a poiyalanine helix was calculated for I ns in the presence of identical 
restraints. 

Peptides containing a 3-methylene bridge maintained a consistent helical conformation but showed 
significant "bending" of the helix axis. Peptides containinga 4-methy lene bridge maintained helicity with little 
distortion, having comparable backbone dihedral angles to the control peptide; x\ md *2 of the 

tethered glutamines did not change during the simulation. Peptides based on a 5-methylene bridge transiently 
escaped out of a helical conformation into nested turns centered around the 1+5 residue. Multiple side-chain 
rotamers were also observed in the /+ 7 residue. Based on these observations, it was determined that the 4- 
methylene bridge would provide the preferred tether length. 
Synthesis and Characterization 

Amino acid sequences for trial peptides were based on the C-terminal helix of apamin (Habermann, 
E. and Reiz, K.G.,Biochem. Z 1965 t 343, I 92-203 ;Callewaert > G.L., Shipoiini, R., and Vernon, CA, FEBS 
Lett. 1968, U 1 1 1-1 13; Shipoiini, R., Bradbury, A.F., Callewaert, G.L., and Vernon, cX Chem. Commun 
1967, 679-680) and on S peptide derived from the C-peptide from RNAse A (Brown, I.E.; Klee, WA. 
Biochemistry 1971, 10, 470-476), The sequences of these peptides are shown in Table I below. 

Table 1 . Structures of peptides 1-4 



Peptide 


Sequence 


Side Chain Protection 


1 


Ac T.N X D L A A RRZQ Q NH 2 .._ 


a: protected, on resin, X=Glu(OAli), 
Z=GIu(OFm) 






b: X=Z=Gln(NMe) 


2 • .- 


Ac A E X A A A KFLZ AHA NH 2 


c: X-Z=Gln(N(CH 2 ) 3 N)Gln j 






d: X-Z=Gin(N(CH 2 ) 4 N)Gln 






e: X-Z=Gln(N(CH 2 ) 5 N)Gln 






f: protected, on resin, X=Z=Glu(OFm) 








3 


AcTNXD LAARRZQQNH 2 


a: X-D-Thioiys(Acm), Z-L-Thiolys(Acm) 






b: X-Z=D-Thioiys-S-S'L-Thioiys 


4 


Ac A EX A A A KFLZ AH ANH 2 





20 



25 



Linear protected peptides la, If, and 2a were synthesized by standard Merrifield methods using t- 
butyloxycarbonyl (BOC) chemistry. Control peptides lb and 2b were elaborated from If and 2a by 
simultaneous deprotection of both glutamate residues followed by coupling with methylamine (Figure 1). 
Synthesis of Id from If by double deprotection and coupling with 1,4-butanediamine was achieved in low 
yield/purity. Constrained peptides Ic-hj and 2c-e were elaborated from la and 2a by -removal of the 
fluorenylmethylester from Glu3, coupling with the appropriate alkanediamine, removal of the allyl ester from 
GlulO, and cyclization{Figure I). Yields were improved by the use of mono-BOC protected alkanediamine 
in the first coupling step and by the use of a polystyrene resin with 2% divinylbenzene (DVB) crosslinker. 
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The completed peptides were cleaved from the resin with hydrofluoric acid (HF) and purified by preparative 
high performance liquid chromatography (HPLC). Installation of the tether on the solid phase allowed the 
completion of the synthesis with only a single purification. 

Thioiysine based peptides 3a and 4a were synthesized in the linear acetamidomethyl-protected form 
5 using standard Merrifield methods and FMOC chemistry, followed by cleavage from the resin with 
trifluoroaceticacid/triethylsiIane(9:I v/v) and purification by preparative HPLC. These were converted into 
the disulfide forms 3b and 4b in solution by simultaneous deprotection and oxidation with acetic acid and 
molecular iodine in trifluoroethanol. 

Peptides 1-4 were characterized by mass spectrometry and by quantitative amino acid analysis. AH 
1 0 peptides gave results consistent with the intended structures. 

Protected D- (7) and L-Thiolysine<l 0) were prepared as shown in Figure 2. The Schollkopf reagent 
(SchBllkpf, U.; Groth, U.; Deng, C. Angew.Chem. Int. Ed. Engl. 1981, 20, 798-799) was treated with 
n-butyllithiumfollowedby 1,4-dibromobutaneto give the known bromobutyl pyrazine 5. The bromide was 
displaced with the potassium salt of methoxytoluenethiol to give 6. The pyrazine was hydrolyzed with 
1 5 aqueous hydrochloric acid (HC1) and the thiol was deprotected with mercuric acetate (HgCOAch) in TFA 
followed by H 2 S. The crude thioiysine ethyl ester was then reprotected with acetamidomethanolin TFA. The 
ester was hydrolyzedwith lithium hydroxide (LiOH) and the free S-protected amino acid was N-protected with 
Fmoc N-hydroxysuccinimide in dioxane to give 7. The same procedures were used for the synthesis of 10. 
Proton NMR 

20 Peptides 1-4 were studied by 2D 1 H NMR. Resonance positions were obtained by standard 

sequential assignment methods (Wuthrich, K. (1986) NMR of Proteins and Nucleic Acids., Wiley, New 
York), and are listed in Table 2 below. 
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Table 2. Chemical Shifts 2 (backbone coupling constants'*) of the apamin-sequence peptides 1 and 3 



Residue 



lb lc Id le 3a 3b 



15 



2.02 2.01 2.02 2.02 2.02 2.02 



Acetyl 
CH 3 
Thrl 

H N 8.34 8.47 8.46 8.46 8.33 8.34 

H a ( 3 JHN-H«) 4 ' 22 < 7 - 2 > 4 ' 22 < 6 - 9) 4 " 21<6 - 9) 4,22 <6 ' 9) 4 ' 22 < 7 - 5) 4 - 22 

H p 4.17 4.30 4.28 4.30 4.15 4.16 

HY 1.13 1.18 1.17 1-19 .1.12 1-14 



10... Asn2 



jjN 8.69 8.78 8.78 8.79 8.55 8.6 

H^JHN-Ha) 4.54(6.6) 4.47(5.0) 4.49(5.4) 4.47(5.0) 4.60(6.9) 4.54 

(6.3) 

H pj 2:75* 2.77* 2.77* 2.78* 2.73* 2.73* 

i j 
r.ln^i m-Thiolvsme) 

H H 8i49 8.53 8.56 8.46 8.28 8.42 

H a ( 3 JHN-H*) 4.12(6.5) 4.11(4.9) 3.93(4.7) 4.03(4.9) 4.11(6.5) 4.01 

H (J 2101,1.87 2.11,1.75 2.11,1.81 2.09,1.83 1.72,1.64 1.84* 

Hi! ±22* 2.42,2.21 2.33,2.26 2.33,2.24 1.32* 1.29* 



-5 „;„ n .a. n.a. n.a. 1.52* 1-63, 



1.47 



.20 h| 77? 8.01 



4?? SOi ??? 7.99 2.52* 2.53, 

2.59 



H ii 8.32 8.05 8.09 8.03 8.29 8.07 

H«( 3 Jhn-hJ 4.46(6.6) 4.31(4.8) 4.32(4.7) 4.32(4.8) 4.50(6.7) 437 

-j : .... 

Ilk ■ ±65* 2.77,2.61 2.76,2.60 2.76,2.61 2.71,2.60 2.67* 
25 Leu5 

H N 8.20 7.92 7.93 8.03 8.24 8.15 

Hf( 3 JHN. H «) 4- 05 < 5 - 7 > 3 ' 95 < 5 - 3) 397(5 - 1) 394(4 - 8> 4 -° 7(5 - 5) 4 { 5 6 S) 

H P 1.60* 1.69,1.50 1.67,1.50 1.66,1.50 1.61,1.51 1.64, 

HY 1.50 1.61 _K60 K59 LSS 1-61 
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10 



15 



20 



25 



H° 

Ala6 
H N 

H^ 3 JfflMfa> 

hp 

Ala7 
H N 

H a ( 3 JHN-H«) 

HP 

Arg8 
H N 

H a ( 3 JHN.H«> 

hp 

H* 

H e 

Arg9 
H N 

H a ( 3 JHN-H*) 
HP 

H 5 
H e 

fihiinfL-ThiolvsineV 
H a ( 3 JHN-Hcc> 

HP 



0.84,0,76 0.83,0,79 0.83,0.79 0,83,0.79 

8.14 8.01 7.93 8.02 

4.07(5.1) 3.96(4.6) 3.97(nd) 3.96(5.0) 



1,33 



L34 



L77* 



3.11* 

999 



1.75* 



3.11* . 

??? 

8.30 

4.14(6.3) 



1,37 



1.33 



1.46 



1.47 



1.86* 



1.85* 



1,87* 



1.83* 



3.13* 
7.24 

7.92 

4.02(5.4) 



3.12* 

7,23 



2.03,1.95 2.11" 



2,08* 



1.36 



7.94 8.67 8.59 8.35 

4.13(5.4) 3,77(4.4) 3.94(4.8) 3.87(4,5) 



1.46 



8.01 7.65 7.91 7.82 

4.14(6.3) 4,01(4.9) 4.03(4,6) 4,02 (4.9) 



1.86* 



1.64,1.56 1.76,1.60 1.73,1.59 1.77, 1,59 



3.15,3,08 3.14,3.05 3.15,3.06 
7.21 7.23 7.23 



8.17 7.78 7.89 7.94 

4.17(6.1) 4,10(5.5) 4.11(5.2) 4,08(5.1) 



1,83* 



1.61,1.54 1.71,1.56 1.71,1-57 1.72,1.56 



3,11* 
7.23 



7.71 7.73 
4.17(6.3) 4.07(5.6) 



0.85,0.77 0.85, 
0.79 



8.18 



8.12 



4.07(5.2) 4.11 
(5.8) 



1.33 



8.00 



1.38 



8.21 



4.11(5.2) 4.09 
(5-5) 



1.33 



8.08 



1.40 



7.89 



4.14(6.3) 4.17 
(6.1) 



1.75 s * 



1,83, 
1.75 



1.62, 1.55 L54, 
1.49 



3.11* 
??? 

8.19 



3.10* 

??? 

7.97 



4.18(6.3) 4.21 
(6.6) 



1.74* 



1.85, 
1.77 



1.61, 1.54 1.64, 
1.56 



3.11* 

??? 

8.18 



3.11* 

99? 

8.09 



4.14(6.4) 4.16 
(6.3) 



2.12,2.06 1.71* 



1,75, 
L65 



-130- 



_9820O38A 1 I > 



WO9S/20036 



PCT/US97/20069 • 





2.26* 


2.47,2.41 


2.49,2.36 


2,40, 2.27 


1.34* 


1.32* 


H 6 


n,a. 


n.a. 


n.a. 


n.a. 


1.64* 


1.54, 
1.49 


H e 


??? 


7.71 


??? 


.7.85 


2.53* , 


??? 


Gin 11 














5 H N 


8.36 


7.79 


7.S2 


7.87 


8,39 


8.19 


Ha < 3j HN-Hcc) 


4.18(6.9) 


4.15(6,1) 


4.17(6,3) 




A 1 Q (ft fC\ 


A OC\ 

(6.6) 


hp 


2.04,1.95 


2.13,2.04 


2.09,2.04 


2.12,2.05 


2.04, 1.95 


2.05, 
1.96 


n 


2.32* 


2.44,2.38 


2.38* 


2.39* 


2.32* 


2.31* 


GM2 








• 






10 H N 


8.40 


7,97 . 


8.11 


7.87 


8.34 


8.31 


HCC ( 3j HN-Hcc) 


4.16(7,0) 


4.15(6,7) 


4.17(6.7) 


4,15(6.7) 


4.18(7.0) 


4. IS 
(6.9) 


hp 


2.04,1.93 


2.08 t L99 


2.07,1.96 


2.08, 1.98 


2.05, 1.94 


2.06, 
1.93 




2.32* 


2.41,2.37 


2.36* 


2.39, 2.35 


2.31* 


2.32* 



a Chemical shifts obtained at pH 4.5 and 5 °C. Shifts are relative to the internal H 2 0 resonance at 4.96 
15 parts per million (p.p.m.), and are accurate to ± 0.02 p.p.m, 

b 3 ^HN-Ha m listecJ m par entneses m units of nertz ( Hz )* 

Representative TOCSY and ROESY spectra of a diamide-constrained peptide (lc) are shown in 

Figures 10 and 3. A summary of the H N -H N and H tt -H N ROEs between neighboring residues used to make 

these assignrnentsare depicted in Figure 4 for lb and lc. The degree of helicity of each peptide was judged 
20 from these spectra by the presence : of intense sequential H N -H N ROE cross-peaks, the presence of/, 1+3 

H a -H N or H a -H0 ROE cross-peaks and ^HN-Ha iess than 6 *° Hz * 1116 data sumnwized m Figure 4 

3 

indicate that peptide lc is helical between residues Asn2 and Glnl 0. Beyond GlnlO, ^HN-Ha rises above 
6.0 Hz but some medium range ROEs are still present, indicating partial or transient helical character. Such 
fraying at helix termini is commonly observed in NMR studies of peptides and proteins. The H chemical 

25 shifts of lc change by less than 0.02 ppm over the concentration range 8.0-0.06 mM; this indicated that the 
helical conformation was not stabilized by a self-association event. 

The incorporation of the diamide cross-link in peptides 1 and 2 dearly reduced the mean value of 
3j HN-Ha in the restrained region, increased the number of observable (7, 7+J) ROEs and increased the 
percent helicity observed by circular dichroism (CD) as shown in Table 3 below. Thus, peptides !c-le and 

30 2c-2e were significantly more helical than the control peptides 1 b and 2b. The results for peptide 4 indicated 
that formation of the disulfide bond constrained the peptide to be helical. However, a number of medium- 
range ROEs could not be observed and 3 ^HN-Ha vaiues were greater than 6,0 Hz for the wo thiolysine 
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residues and Leu9 in 4b; this indicated a distortion from an ideal helical structure in the region of the D- 
thioiysine residue, as expected from simple structural considerations. The data in Table 3 below indicated that 
incorporation of the disulfide bond in peptide 3b did not impart helical character, suggesting that the thioiysine 
method may have a dependence on primary sequence and is therefore not generally applicable. 
5 Table 3. Evaluation of peptide helicity. 



Peptide 


Description 


Mean 3 J NH . Na in 
constrained region 


Fraction of 
medium-range 
ROEs obs. 


Percent 
Helicity by 
CD 


lb 


Apamin, N-rnethyl Gin Control 


6,00 


0.14 


20 


lc 


Apamin, 3 carbon linker 


4.98 


0.69 


84 


Id 


Apamin, 4 carbon linker 


5.18 


0.56 


63 


le 


Apamin, 5 carbon linker 


4.96 


0.69 


100 


2b 


C-tide, N-methyl Gin Control 


5.89 


0.08 


32 


2c 


C-tide, 3 carbon linker 


4.81 


0.75 


60 


2d 


C-tide, 4 carbon linker 


4.83 


0.43 


82 


2e 


C-tide, 5 carbon linker 


4.90 


0.80 


63 


3a 


Apamin, S-Acm thioiys control 


6.01 


0.03 


10 


3b 


Apamin, thioiys disulfide 


5.96 


0.08 


35 


4a 


C-tide S-Acm thioiys control 


5.96 


0.05 


19 


4b 


C-tide, thioiys disulfide 


5.65 


0.48 


27 



Values below 6 for the mean 3-bondNH-Na coupling constant indicate helicity. Medium range 7-/4-3 ROEs 
20 are expressed as the observed fraction of the total number of such ROEs possible, with very weak ROEs 
counted as one half. Percent helicity as determined by CD is derived as by Lyu et al.» Sherman, J.C.; Chen, 
A.; KailenbachjN.R,, Proc* Natl Acad. ScL U.S.A. 88:5317-5320 (1991), andJohnson, W.C.; Tinoco Jr., I., 
J. Am Chern, Soc, 94: 4389-4390 (1972) 

Peptide lc was chosen for a more detailed analysis by NMR. ROESY spectra with higher sensitivity 
25 (increased total acquisition time and peptide concentration) and NOES Y spectra were acquired and analyzed 
to provide input restraints to structure calculations. In addition to the ROEs described above, H -H (/, 1+4) 
interactions were observed, indicating that the helical conformation adopted is not of the 3|q type, but rather 
is of the regular a helical variety (Wuthrich, K. (1986) NMR of Proteins and Nucleic Acids., Wiley, New 
York). Interprotondistance restraints were generated from the ROESY and NOESY data, and used as a basis 
30 for calculating a structure for lc using distance geometry (DG) and restrained molecular dynamics (rMD). 
Nearly half (66) of the 141 restraints were between amino acids two to four residues apart in the primary 
sequencers expected for a helical conformation. Dihedral angle restraints, based on observed ^HN-Ha 
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and 3 J Ha _ H p, were also used in these calculations, but explicit hydrogen bond restraints were not utilized. 

The final ensemble of 20 structures is depicted in Figure 5. The structures agreed with the input data 
very well, with no distance restraint violations above 0.1 angstroms (A), no dihedral angle violations above 
IX^andameanrestrauit™^^ >■ ™ e available 

5 NMR data define well the backbone atoms of residues Thrl to GinlO (average root mean squared deviation 
from the mean structure = 038 ± 0.08 A), but the two C-terminai glutamine residues are not well defined. The 
side chainsof Thrl , Gui3, Asp4,Leu6 and GlnlO have well defmed Xl values, but only GlnlOhas a consistent 

value of X2 in ali structures. 

H N (/) - O(/-0 hydrogen bonds are were to the amide protons of Leu5, Ala6, and GlnlO in greater 
10 than 90% of the structures, indicating that these residues adopted a predominantly ^helical conformation. 
Although (4M) hydrogen bonds were observed to the amide protons of Ala7,Arg8 and Arg9 in approximately 
50% of the structures, H N (/) - 0(W) hydrogen bonds were present in 25-35% of the structures, indicating 
that there was a slight distortion of the helix in this region. The data presented in Table 4 below indicated that 
the amide hydrogens of Leu5 to GlnlO were all protected from exchange with solvent in peptide lc compared 
15 to the control peptide lb by factors of up to 25. This observation is also consistent with the amide hydrogens 
of these residues participating in hydrogen bonds. Interestingly, hydrogen bonds from Asp4 H to Thrl 0 Y 
were present in 80% of the structures, indicating that an N-cap hydrogen bonding interaction (Harper, E.T.; 
Rose, G.D. Biochemistry 1993, 32:7605-7609) was present even in. this short peptide. However, the amide 
proton of Asp4 was not noticeably protected^ hence this hydrogen bond may be more 

20 transient. . :: --.--r. - 

Table 4. Amide hydrogen exchange rates constants a and protection factors b for peptide lb and lc 



25 



30 



Residue 



Thrl - 

Asn2 

Gln3 

Asp4 

Leu5 . 

Ala6 " 

Ala7 

Arg8 

Arg9 

GlnlO 

Gin I t 

Glnl2 



log k (lb) 



-2.44 

n.d. 

n.d. 

-2.72 

-2.69 

-2.73 

-2.51 

n.d. 

n.d. 

n.d. 

n.d. 

n.d. 



logA(lc) 



-2.48 

n.d. 

n.d. 

-2.84 

-3.51 

-3.77 

-3.55 

-3.33 

-3.21 



-3.29 



-1.83 

n.d. 



Protection Factor 



approx. 1 



I. 3 
6.7 
10.9 

II. 1 
>26 
>20 



>25 



approx. 1.0 
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» The rate constants are expressed in units of seconds' 1 (s' 1 ). n.d. indicates that the exchange was sufficiently 

fastthatnopeakwasobservedm.tte ofD 2°- In these 

cases, assumingthat greater than 90%of the hydrogens have exchanged in 300sallo W sa lower limit of 0.013 

s" 1 (log k = -1 : 89) to be calculated for the rate constant. 

b Protection factors are calculated as the rate constant for peptide lc divided by that of peptide lb. 

With the exceptionof the f angles of Ala6andGlnl0, the backbone dihedral angles throughout the 
tethered region were close to those expected for an ideal a helix (mean d> = -63°±2\ mean * = -42"±8°) 
indicatingthatanydeviationfromidealitywasveryslight. The i|f of Ala6 is was 15° lower than expected for 
an a helix and was more similar to that expected for a 3 10 helix; the higher value of iji for GlnlO reflected 
the fraying beyond the tethered region. The slight distortion at Ala6 could be the result of the short tether 
present in this peptide (only three methylene groups). Although the diamide linkage was not well defined by 
the NMR data, the side-chains of Gln3 and GlnlO adopted conformations close to those predicted by the 
modeling experiments described above (Gln3 *l = -173'±17", % 2 = 34°±47°; GlnlO X \ — 71°±7% * 2 = 
174°±22°). Theoverall conclusion was that in solution.lc adopted an a helical structure from Asp2 to GlnlO 
15 with an N-terminal capping box and a very slight distortion in the central turn of the helix. 
Circular Dichroism 

CD spectra were acquired on aqueous solutions of 1-4 between 20 and 120 micromoles/liter (uM) 
at 280K, pH 5. Spectra of peptides 1 and 3 (apamin sequence) are shown in Figure 8 and those of peptides 
2 and 4 (C-peptide sequence) in Figure 8. Numerical values for percent helicity, calculated from the per- 

20 residue molar ellipticity of the peptides at 222 nanometers (nm), are shown above in Table 3 . 

The CD data supported the conclusions derived from the NMR studies. Both tethering methods 
substantiallyenhancedthe helicity of the C-peptide sequence (Figure 8). However, only the diamide method 
was capable of rendering the apamin sequence helical under the conditions used; the thiolysine-constrained 
peptide 3b did not appear to be helical (Figure 7). The CD spectrum of 4b, in spite of substantial negative 

25 ellipticity at 222 nm, showed several features which indicated a lesserdegree of helicity than those of 2c-2e: 
the short-wavelengthminimum in 4b was shifted from 208 nm (a typical value for an a helix) to 204 nm, and 
the observable shoulder of the 190 nm maximum was much smaller than those of 2c-2e. 

Thermal denaturationexperimentswereperformedon the apamin-basedpeptides Ib-le. In the initial 
experiment, CD spectra of peptides 1 b-1 e were taken at 1 0 °C intervals from 7 °C (280 K) to 57 °C(330K). 

30 Given that lc-le showed good retention of helicity in this temperature range, spectra of 1 c were taken up to 
97 °C (370 K), where some loss of helicity was observed (Figure 6). The molar ellipticity of 1 c at 97 °C and 
222 nm was still substantially more negative than that of the non-heiical control peptide lb at 7 °C and 222 



nm. 



Experiments to examine the effectsof heating and recoolingthe peptideswere complicatedby several 
35 factors: the CD spectrometershowed a baseline drift over long experiments; the concentration of the samples 
changed because of evaporation at higher operatingtemperatures; and there appeared to be some variation in 
sample behaviordependingon the rates of heating and cooling. A set of CD spectra of 1 c was acquired before, 
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during, and after heating at 87 "Cforoneday. The effect of baseline drift was reduced by linear normalization 
of the spectra based on & 2 45- ^ effect of concentration chan S e due to evaporation was corrected 

by normalizing the post-heating spectrum to the same amplitude as the pre-heating spectrum at wavelength 
(X) of 204 nm. This wavelength was chosen as the point where an « helix and a random coil have equal 

5 contributionsto the eliipticity, and hence interconversion of a peptide between these conformations will not 
affectthe magnitude of the eliipticity. The resulting spectra are shown in Figure 9. The close match in curve 
shape between the pre- and post-heating spectra indicated that most or all of the helical structure was regained 
on cooling after the partial denaturation induced by heating at 87 °C. The small difference in overall 
amplitude could be due to a small amount of permanent denaturation or could be an artifact of the 
10 normalizationprocedure. This experiment demonstrated that the a helix of lc was stable to relatively harsh 
conditions, a feature which improves its general utility. 
Conclusion 

A new method for constraining small peptides to an a helical conformation has been devised. This 
/ to /+7 amide-based tether is successful as a general method for inducing a helicity in small peptides and 

15 possessesseveraldesirablefeatures. First, it allows the maximum possible sequence variability. Any residue 
except the two tethering residues themselves may be changed. Second, the helicity induced by this method 
approaches 100% in aqueous solution at room temperature (RT). The comparison of helical peptides lc-le 
with non-helicalpeptide lb shows that the helicity is achieved by introduction of the linker rather than being 
a property of the primary sequence. Third, these tethered peptides are synthesized by standard solid-phase 

20 (Merrifield) chemistry and require only inexpensive.commercialiyavailablereagents. Fourth, the method can 
be used for peptides as short as eight residues. Fifth, it poses no chemical requirements as to environment and 
has been shown to induce good helicity despite changes in temperature and buffer conditions. This method 
is generallyusefulforstudiesof biologically active helical regions of proteins, for the experimental study of 

- helix formation,propagation, and stability, and for physical organic experimentson the interactions of helical 

25 peptides with their environments, — -— 

F.Y AMPLE 2 

The peptide cyclized peptide FNM(5)QQRRFY(6)ALH (Fig. 11) was synthesized using Fmoc 
chemistry with standard solid phase protocols in which Fmoc-glutamic acid, 6-(5-allyloxycarbony 1-1,5- 
diaminopentane){5) (synthesizedas described below) and Fmoc glutamicacid 6-aliyl ester (6) (commercially 
30 available from Millipore) are incorporated as standard amino acids in peptide synthesis, followed by 
cyclization as shown in Fig. 1 1. Fmoc-glutamic acid, 5-(5-allyloxycarbonyl- 1,5 -diaminopentane) (5) was 
synthesized as shown in Scheme 1 below. — 
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Scheme 1 



Boc-ON 
THF 



,NHBoc 



(D 



Atiyl 

Chlorofornnate 



HNAlloc^x^^ NHBoc 
(2) 



TFA 



(3) 



HNFmoq r X p|B|j (3) HNFmocyJ^ 



O 

HNFmoci^A 



OH 



o=< 

OH 



DCC 

HOBT 

DCM 




NHAIIoc 



Mono-t-butyloxycarbonyl (BOC) 1,5-pentanediamine was synthesized by using 1,5- 
diaminopentane (12.5 g, 122 mmot) in place of 1,3-diaminopropane in the synthesis of mono- 
al!yloxycarbonyH,3-diaminopropanalescribedin Example 1 above, yielding 10 g (49 mmol) of mono-tert- 
butyloxycarbonyl-l,5-diaminopentan<l). The mono-ter/-butyloxycarbonyi-l,5,diaminopentane (1) (5.8 g, 

5 28.7 mmol) was dissolved in 75 raL of dichloromethane with 7.5 mL of diisopropylethy lamine and cooled to 
0° C. A solution of allyl chloroformate (3.3 mL) in dichloromethane (25 mL) was added over five minutes. 
The reaction was allowed to warm to room temperature for one hour and then solvent was removed by rotary 
evaporation. The residue was dissolved in 100 mL of ethyl acetate and washed with three 100 mL portions 
of 10% citric acid, once with 100 mL saturated aqueous sodium bicarbonate and once with 100 mL of 

10 saturated aqueous sodium chloride. The organic phase was dried over magnesium sulfate and solvent was 
: removed by rotary evaporation. The resulting oil (2) was treated with 25 mL of trifluoroacetic acid for 30 
minutes. The trifluoroacetic acid was removed by rotary evaporation and the resulting reside was twice 
dissolved in dichloromethane and then evaporated to remove residual solvent. The residue was dissolved in 
50 mL of 3N hydrochloricacid and washed with two 50 mL portions of dichloromethane. The aqueous phase 

15 was cooled in an ice bath and the pH was adjustedto approximately 13 with 50% aqueous sodium hydroxide. 
The basic aqueous phase was extracted with three 100 mL portionsof dichloromethane, the combined organics 
were washed with 100 mL of saturated aqueous sodium chloride and then dried over potassium carbonate. 
The mixture was filtered, the solvent removed first by rotary evaporation and then by high vacuum to yield 
3.95 g of mono-ally loxycart>onyl- 1 ,5-diaminopentane (3) as a colorless oil. 

20 Fmoc-Glutamicacid, a-ferf-butyl ester, 9.0 g (21.1 mmoUBacfaem California) was dissolved in 100 

mL of dichloromethane.Dicyclohexylcarbodiimide(4.4 g, 21 .3 mol) and N-hydroxybenzotriazole (0.3 g, 2.1 
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mmol) was added to this solution, followed by the m ono-alIyIoxycarbonyl-l,5-diaminopentane (3) (3.95 g, 
21.2 mmol). The reaction was stirred at 25° C for 14 hours, then cooled to 0° C for one hour. Insoluble 
material was removed by filtration, and the filtrate was concentrated by rotary evaporation. The residue was 
dissolved in 150 mL of ethyl acetate and washed twice with 100 mL of 10% aqueous citric acid, twice with 

5 100 mLofsaturatedaqueous sodium bicarbonate and once with lOOmLbrine. After drying over magnesium 
sulfate and filtering the solvent was removed by rotary evaporation. The residue was dissolved in 
approximately 75 mL of ethyl acetate with heating and 2:1 hexanes:ethyl acetate was added until the solution 
became cloudy. After standing for several hours the crystalline precipitate was removed by filtration, the 
white crystals were washed with 2: 1 hexanes:ethyl acetate and dried under vacuum to yield 1 1 .4 g of (4) 

10 (90%). 

The tert -butyl ester (4), 1 1 g. 1 8.5 mmol) was dissolved in 50 mL of trifluoroaceticacid with stirring. 
After 45 minutes, the trifluoroaceticacid was removed by rotary evaporation;residual trifluoroacetic acid was 
removed by evaporation from 50 mL of dichloromethane three times. The residue was dissolved in 75 mL 
of ethyl acetate with heating, filtered through celite, and 3:1 hexanes:ethyl acetate was added until a haze 

1 5 developed. Crystals were allowed to grow at 25° C for three hours, then cooled to 0° C for one hour. The 
crystals were isolated by filtration and washed with 3:1 hexanesiethyl acetate, then dried under vacuum to 
yield 9.5 g (95%) of (5) as off white crystals. 

Followingpeptide synthesis of FNM(5)QQRRFY(6)ALH, the N-terminus of the solid phase peptide 
was coupled to mono /ert-butyl-succinic acid the allyl and allyloxycarbonyl protecting groups were removed 

20 using 500 mg Pd(PPh 3 ) 2 Cl 2 in 20 mL of 20% piperidine in dimethyl acetamide for 1.5 hours at room 
temperature. The resin was then washed with 20% piperidine in dimethyl acetamide, dimethyl acetamide, 
dichloromethaneand finally with 0.5% trifluoroacetic acid in dichloromethane. The resin was suspended in 
dichloromethane and 1.5 equivalents of HATU with 3 eq N,N-diisopropylethylamine in 5 mL of dimethyl 
acetamide was added. After two hours the resin was checked for free amines by ninhydrin test and found to 

25 be negative. The peptide was cleaved from the resin with 95% trifluoroacetic acid 5% triethylsilane and 
purified using reverse phase HPLC. . 

The helical structure of the cyclized peptide shown in Fig. 1 1 was confirmed by circular dichroism 
(CD) and nuclear magnetic resonance (NMR). Both of these methods indicated that the locked helix peptide 
displayed predominantly a-heltcal character. The locked helix peptide was determined to bind IgG with an 

30 affinity (Kd) of approximately 1 00 uM both by microcalorimetry and surface plasmon resonance. A control 
peptide lacking the locking portion of the molecule did not exhibit IgG binding detectable by 
microcalorimetry. 

EXAMPLE 3 

To confirm that the covalent locking mechanism is fully functional and that peptides constrained by 
35 this technique are able to bind ligand with high affinity, a 33 amino acid peptide based on helix 1 of the Z 
domain of protein A was synthesized with the i to i + 7 linkage as shown in Scheme 2 below: 
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Scheme 2 

suc-FNM (5) QQAKFY (6) ALHDPNLNEEQRNAKIKSIRDD^nh2 




where (5) and (6) are the allyloxycarbonyl and allyl protected amino acids described in Example 2 above. 
The peptide was synthesized and cyclized as described in Example 2 above. The helicity of the peptide 
was verified by CD and NMR, and thermal denaturation of the peptide as monitored by CD indicated that 
the peptide only partly unfolds at 90° C, consistent with the stability of the covalent linkage. The IgG 
binding affinity (Kd) of this peptide (as measured by surface plasmon resonance) was determined to be 
approximately 20 nM. 

EXAMPLE 4 

Linear peptides derived from the ectodomain of the HIV-1 envelope protein gp41 are known to 
inhibit viral fusion events. The most potent of these (DPI 78) corresponds to a membrane proximal region 
of gp41, which is predicted to be ot-helical. However, DP178 itself lacks discemable structure in solution, 
rendering mechanistic interpretation of its activity difficult. By applying the helix locking chemistry 
taught herin, constrained versions of DP 178 were made to determine whether helicity is necessary or 
1 5 sufficient for its infectivity inhibition activty and to define a likely mode of action for this molecule in 
primary infection (as measured using viral infectivity assays). 

By constraining DPI 78 analogs into a helical conformation we show that helicity is necessary, 
but not sufficient, for inhibitory potency. The correct face of the helix must also be exposed. Two recent 
crystal structures of gp41 indicate that this face is buried in a groove formed by a coiled-coil trimer. 
20 Taken together, these results indicate that DP178 inhibits infectivity by blocking this groove, and that the 
conformation of gp41 observed by crystallography represents the fusogenic state. 

A series of analogs of DPI 78 in which segments of the amide backbone were constrained to be 
helical (Figure 12) were prepared. Because short ct-helices are usually unstructured in solution (Marqusee 
etaLPrvc Natl. Acad Sci. USA 86:5286-5290 (1989)), a covalent crosslink between amino acid side 
25 chains at positions i and f+7 of the polypeptide chain as taught herein (see also Phelan et al., J, Am. Chem. 
Soc, \ 1 9:455-460 (1997), which is incorportated herein by reference) which lock the intervening residues 
into a stable a~helical conformation. 

A truncated form of DPI 78, designated HIV35 (Figure 12) was used as a reference. In the 
absence of detailed information regarding the associaton of DP178 with DP107, the coiled-coil 
30 propensities (Lupas et ah, Science 252:1 162-1 164 (1991)) for 29 distinct gpl60 sequences were computed 
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in order to determine whether the region corresponding to DP178 scored as a coiled-coil. The N-terminal 
27 residues of DP178, selected for the reference peptide HIV35, maintained a high overall score with a 
consistent heptad register. The "a-d" face predicted by the scoring algorithm corresponded to the face seen 
to pack against the trimer core. This corresponding region is entirely helical in the x-ray structures, and 

5 packs against the trimer core using a 4-3 heptad repeat akin to that found in coiled-coils. Using the helical 
locking chemistry and methods taught herin we enforced the exposure of this repeat (positions "a" and "d" 
of the heptad) by introducing crosslinks between pairs of adjacent residues on the opposite face of the 
heiix (position T ). Thus, "f '-to "f * (tethers) locks were made to constrain the potential helix. Analogs of 
HIV35 (Figure 12) were prepared containing either one (HIV24) or two (HIV31) tethers to impart 

10 increasing heiicity. A control peptide (HIV30) was prepared in which a tether was introduced between 
successive "d" residues to stabilize heiicity while blocking potential binding interactions across the "a-d" 
face. |. . 

i Linear peptides were synthesized according to standard solid phase techniques using Fmoc 
chemistry (Fields et al., Int. J. Peptide Protein Res. 35:161-214 (1990)) as taught herein. In particular, 
15 helix dipole effects were minimized by blocking the C-termini as amides and the N-termini as succinate 
groups. After formation of the lactam bridges as taught herein (see also Phelan et al., J. Am. Chem. Soc. 
1 1 9:445-460 (1997)), the peptides were cleaved from the resin and purified to homogeneity using 
preparative reversed phase HPLC with water/acetonitrile/0.1% TFA gradients in the mobile phase. The 
identity of each peptide was confirmed by electrospray mass spectrometry: HIV 24, calculated mass 
20 3396.8, observed, 3396.0; HIV 30, calculated mass 3413.7, observed, 3413.8; HIV 3 1 , calculated mass 
3520.0, observed, 3520.7; HIV 35, calculated mass 3330.8, observed, 3330.5. 

\ Circular dichroism analysis (Figure 1 3) confirms that the locking strategy markedly increases the 
heiicity of the DP178 truncations. CD spectra were recorded on an AVIV 62DS CD spectrometer using 
0.05 cm pathlength cuvettes. I Spectra were gathered by averaging data from three runs spanning 250 nm to 
25 190 mn in 1 .0 nm increments, with 2 second averaging time at each wavelength. Peptide concentrations 
were Approximately 200 uM in a solution of 10 mM Tris HC1 pH 7.5 with 6% acetonitrile (v/v). For 
conversion of raw data to molar ellipticity values, precise concentrations were determined by measuring 
A 276 Ind A 2&0 (Edelhoch, Biochemistry 6:1948-1954 (1 967)); these values were confirmed by 
quantitative amino acid analysis. 
30 The unconstrained peptide HIV35 has an almost featureless spectrum, similar to that reported for 

DPI 7)5 (Lawless, et al., Biochemistry 35:13697-13708 (1996)). The CD spectra of peptides containing a 
singli constraint (HIV24 and HIV30) display minima at 209 and 222 nm characteristic of oc-helices. The 
intensity ratios of these two regions are skewed from ideality, suggesting that regions of the peptide 
backbone outside the consttained segment are disordered. By constrast, the doubly-constrained analog 
35 HIV31 appears to be largely helical by CD, giving the shape and intensity profile of a typical a-helix. 

1 Viral infectivity assays were used to characterize the locked-helix constructs. Normal human 
peripheral blood mononuclear cells (PBMCs) were stimulated with phytohemagglutinin (PHA) in RPMI 
1640 medium containing interleukin 2 for 24 hours. The PHA medium was removed and the cells grown 
overnight in RPMI 1 640 with glutamine, 20% heat inactivated fetal calf serum, and gentamicin. At the 
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start of the assay, pre-titered virus stocks were equilibrated with peptides for one hour before adding to the 
PBMCs (2.5 x 10 5 cells per well). Cells were grown for three days, rinsed to remove extracellular virus 
and peptides, then supplemented with fresh medium and grown for an additional four days. After seven 
days the cells were iysed and p24 antigen was determined by ELISA. Peptides were run in triplicate at 
5 each concentration. Viral titers were determined in duplicate for each run. Each assay also included the 
following controls, in triplicate: Uninfected ceils as a negative control, infected cells without peptide as a 
positive control, and virus innoculum without cells to establish a baseline p24 level. Peptides were tested 
for cytotoxicity by incubating them at the highest assay concentration (approximately 1 00 uM) with 
uninfected cells and then growing the cells as described above. After 7 days the cell counts were estimated 
10 by microscopy and compared to an identical batch of cells which were not treated with the peptides; none 
of the peptides inhibited normal cell growth under these conditions. 

When tested in viral infectivity assays, the peptides displayed a striking pattern of relative 
potency that extended across both syncitium inducing (SI) and non-syncitium inducing (NSI) strains of 
HIV-1 (Zhang et ah, Nature 383:768 (1996)). As shown in Figures 14A and 14B, truncating the 
15 hydrophobic C-terminus of DPI78 (HIV35) caused a dramatic drop in activity, which was partially 
restored when a single restraint, i,e, constrained helical peptide, (and partial cc~helical character) was 
introduced (H1V24). Adding a second restraint (HIV3 1 ) imparted strong helical character and enhanced 
the potency of the peptide to levels comparable to DP178. Thus, the additional stabilization afforded by 
preorganizing HIV3 1 into an active helical conformation offset the loss of binding energy caused by 
20 deleting the C-terminus. By contrast, a single restraint that induced helicity while blocking the "a-d" face 
(HIV30) completely ablated activity. 

A series of shorter constrained peptides spanning positions 63 1 -644, 643-656, 649-662, 656-669, 
and 663-678 of HIV- 1 tethered between adjacent residues at the V ■ positions of the heptad, were 
prepared to determine whether a subset of HIV35 or its N- and C-terminal flanking regions was sufficient 
25 to block infectivity. All peptides, whether constrained or unconstrained, failed to show significant activity. 
Peptide 63 1 -644 contains the hydrophobic cluster observed in the x-ray structure to pack into a cavity in 
the trimer core (Chan et aL, Cell 89:263-273 (1997)). 

The relative activities of HIV35, HIV24, and HIV3 1 demonstrate a clear correlation between 
helicity and inhibitory potency. The widely disparate activities of HIV30 and HIV24 indicate that peptide 
30 inhibition also requires exposure of the face of the helix seen by crystallography to pack against the N- 
termuial trimer core of gp4 1 . 

The data presented herin, combined with prior model studies on isolated peptides and the recently 
published crystal structures, strongly support the hypothesis that the peptides inhibit viral infectivity by 
binding to the resting state of gp41 and preventing formation of the fusogenic state. Peptide HIV3 1 is 
35 conformationally constrained to be largely helical, and is likely to interact as such with an accessible 

cognate surface in the resting state of gp41 . Because x-ray analysis shows that the face of HIV3 1 required 
for inhibiting viral fusion is buried in the groove formed by the N-terminal trimer core, we believe 
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(without being bound to any particular theory) that this groove represents the cognate surface for the 
peptides. 

Figure 15 outlines schematically a current model for assembly of the fusgenic state of gp41, and 
the mechanism by which the constrained helices inhibit this process. The model is presented without 
5 meaning to be limiting to the invention and without binding the inventors to any particular theory of 
operation of the invention. The resting state of gp41 (upper left) is presumed to be constitutively 
trimerized, featuring a coiled-coil bundle near the N-teraiinal fusion peptide (arrow). The region 
corresponding to the C-terminus of the ectodomain (dark lines) is not initially bound to the trimer bundle, 
and has an unknown conformation. A conformational shift resulting from the binding of gpI20 to either 
10 CD4, a co-receptor, or both, may then allow association of the C-terminal portion of gp41 with the N- 
terminal bundle. The resulting antiparallel helical array (top right) observed in the x-ray structures is 
presumably the isogenic state of gp41 . Rearrangement to this state can be blocked if the trimer grooves 
are occupied by inhibitory peptides (bottom left). Once blocked in mis manner, a subsequent 
conformational shift in the gp4 1 cluster would sequester the protein off-pathway (bottom right). 
1 5 Peptides DP 1 78 and HI V24 effectively inhibit the inf ectivity of genetically distant and 

phenotypically distinct subtypes of HIV- 1 (Gao et al., Journal of Virology 70:1651-1667 (1996)). 
Moreover, the surface to which they are proposed to bind is one of the most highly conserved, regions in 
the HIV-1 genome. We have assayed DP 178 against other strains and found it to have similar inhibitory 
potency against the laboratory-adapted strain MN/H9 and primary isolates 301660 and Th009. Strain 
20 Th009 is from subtype E and is genetically distant from the predominant North American subtype B (e.g. 
JRCSF) (Zhang et al., Nature 383:768 (1996)). These results are in accord with observations from other 
labs (Wild et al., Proc. Natl. AcadSci. USA 89:10537-10541 (1992); Wild etal., Proc. Natl. Acad. Set 
USA 91: 9770-9774 (1994); Jiang et al.,. Nature 365:113 (1993)). In addition to JRCSF and BZ167, we 
tested HIV24 against Th009 and found it to have comparable potency, suggesting that the membrane 
25 fusion mechanism proposed extends to widely disparate strains of HIV- 1 . 

Other agents, such as antibodies, which target this surface may thus hold promise for the 
therapeutic treatment of AIDS. 

EXAMPLE 5 

To prepare a vaccine mat would be effective against HIV infection, either as a prophylactic or 
30 post-infection therapeutic (optionally in combination with anti-HI V drugs or other subunit vaccines), 

constrained a-helical peptides from the 633-678 region of gp41 were prepared and used as immunogens. 

Variants of HIV 24 were prepared with the sequence "Gly Gly Cys"at the C-terminus or "Cys 
Gly Gly" at the N-terminus. These peptides were conjugated to KLH using a heterobifunctional 
crosslinker such as 4-(N-Maleimidomethyl)-cyclohexane-l-carboxylic acid 3-sulfo-N-hydroxysuccinimide 
35 ester, available from Sigma, or its equivalent (e.g. "Sulfo-MBS" from Pierce). Immunizations were 
performed as described below. 

Polyclonal antibodies were generated in female guinea pigs (Hartley Strain from Simonson Labs) 
against KLH-conjugated HIV peptides. Fifty fig peptide in 250 uL PBS was emulsified with 250 uL 
Freund's adjuvant (complete adjuvant for the primary injection and incomplete adjuvant for all boosts). 
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Injections of 70-100 ug peptide/kg body weight were administered with a combination of subcutaneous 
and intramuscular sites in a three-week cycle. Bleeds were taken on the second and third weeks following 
each boost. 

Sera from immunized animals was loaded on a Protein A column to provide, on elution, purified 
5 total Ig. Antibodies selective for the locked helices were obtained by passing the total Ig pool over an 

affinity column containing support loaded with immobilized locked helices. This support was prepared by 
first reacting the cysteine-containing peptides described above (HIV 26, 27, 28, and 29) with 
biotin-maleimide (also from Sigma; N-biotinyl-N'-[6-maleimidohexanoyl]-hydrazide) to afford peptides 
biotinylated at either terminus. These peptides were loaded onto a resin pre-loaded with streptavidin 
10 (Pierce, "Ultralink Avidm") to provide the affinity gel described. 

The total Ig pool from the protein A column was passed over the appropriate affinity column (i.e. 
the one with the matching hapten immobilized). Nonspecific antibodies were eiuted in the flow-through 
and saved as negative controls. Specific antibodies were eiuted as from the Protein A column, dialyzed 

into assay buffer, and stored. 
15 Surprisingly, the antibody titers observed were quite high for g P 4l subunit peptides. This is 

particularly surprising since this region of gp41 (633-678) is not known in the art to generate HIV 
neutralizing antibodies. 

The affinity purified polyclonal antibodies are tested in the viral infectivity assays used to 
evaluate the peptides. The haptens used to generate polyclonal antibody preparations that inhibit 
20 infectivity are desirable immunogenic agents for use in a vaccine. Most preferred are candidates that elicit 
broadly cross-reactive antibodies able to neutralize a variety of diverse H1V-1 isolates in vitro. 

Candidate HIV-1 vaccines can be tested in available animal models, for example, in chimpanzees 
as described by Bennan et aL, J. Virol. 7:4464-9 (1992); Haigwood et al., J. Virol. 66:172-82 (1992) and 
Salmon~Ceronetal.,^ASito. and Human Retroviruses 11:1479-86 (1995) for gp 120 subunit vaccines. 
25 Most preferred are candidates that elicit broadly cross-reactive antibodies able to neutralize a variety of 
diverse HIV-1 isolates in these animal studies, providing protection from challenge by homologous and 
heterologous strains of HIV-1 . Successful protection of chimpanzees is encouraging and has historically 
proved to be a reliable indicator of vaccine efficacy. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Genentech, Inc. 

(ii) TITLE OF INVENTION: Constrained. Helical Peptides and Methods of 
5 Making Same 

(iii) NUMBER OF SEQUENCES: 1X3 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 1 DNA Way 

10 (C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 
(F> ZIP : 94080 

{ v) COMPUTER READABLE FORM : 
15 (a) MEDIUM TYPE: 3.5 inch, 1-44 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 

(vi) CURRENT APPLICATION DATA: 
20 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/876698 
25" " (B) FILING DATE: " 6/16/97 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/743698 

(B) FILING DATE : 11/6/96 .. . . 

(viii) ATTORNEY /AGENT INFORMATION: 
30 ... . (A); NAME: Torchia, PhD., Timothy E. 

<B) REGISTRATION NUMBER: 36,700 

(C) REFERENCE /DOCKET NUMBER: P1005R2PCT 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 650/225-8674 
35 (B) TELEFAX: 650/952^9881, 

(2) INFORMATION FOR SEQ ID NO : 1 : _ - - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: Amino Acid 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DP178 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-: 1 : 

Tyr Thr Ser Leu lie His Ser Leu lie Glu Glu Ser Gin Asn Gin 
1 5 10 15 
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Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala 
20 25 30 

Ser Leu Trp Asn Trp Phe 

35 36 _ 

5 (2) INFORMATION FOR SEQ ID NO ; 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

10 {xi) SEQUENCE DESCRIPTION: SEQ ID NO -.2; 

Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin 
1 5 10 15 

Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
20 25 27 

15 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Tyr Thr Ser Leu He His Ser Leu He Xaa Glu Ser Gin Asn Gin 
I 5 10 15 

Gin Xaa Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
20 25 27 

25 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids ' 
{B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Tyr Tnr Xaa Leu He His Ser Leu He Xaa Glu Ser Gin Asn Gin 
1 S . 10 15 

Gin Xaa Lys Asn Glu Gin Glu Leu Xaa Glu Leu Asp 
20 25 27 

35 (2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Tyr Thr Ser Leu He His Ser Xaa He Glu Glu Ser Gin Asn Xaa 
x 5 10 15 

Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
5 20 25 27 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: Amino Acid 
10 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6;. 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 ~ 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
15 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Xaa 
35 40 45 

" lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

20 Met Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu 

65 70 75 

■ - Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 

80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 

25 - 95 100 ■■ 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 

, , ,-110 US s : 120 

Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 135 

30 Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 

140 145 150 

Xaa Xaa He Trp Xaa Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
155 160 165 

He Asp Asn Tyr Thr Xaa Leu He Tyr Thr Leu He Glu Glu Ser 
35 *" 170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
185 190 195 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Xaa He Thr Asn Trp Leu 
200 205 210 

-145- 



WO 98/20036 



PCT/US97/20069 



Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu Val Gly 
215 220 225 

Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
230 235 240 

5 Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Xaa Leu Pro Ala Pro 

245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
26 0 265 26 9 

(2) INFORMATION FOR SKQ ID NO: 7: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: HIV-JRCSF 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
15 10 .15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

20 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 

35 40 45 

Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
25 65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

Ala Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

30 Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 

HO H5 120 

Gin Leu Met Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 135 

Thr Ala Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu Asp 
35 140 145 150 

Ser lie Trp Asn Asn Met Thr Trp Met Glu Trp Glu Lys Glu He 
155 160. 165 

Glu Asn Tyr Thr Asn Thr He Tyr Thr Leu He Glu Glu Ser Gin 
170 . 175 : 180 
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He Gin Gin Glu . Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
185 19° 1^5 

Trp Ala Ser Leu Trp Asn Trp Phe Gly He Thr Lys Trp Leu Trp 
200 205 210 

5 Tvr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu 
y 215 220 225 

Arg He Val Phe Ser Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 24 0 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala Thr Arg 
10 ' ' 245 250 255 

Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 265 268 

(2) INFORMATION FOR SEQ ID NO :B: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 268 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
20 1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala .-Val Gly He 
35 40 45 



25 Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 55 60 

Gly Ala - Ala Ser Met Thr. Leu Thr Val Gin. Ala Arg Leu Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
30 " 80 85 50 

Ala Gin Gin Arg Met Leu Gin Leu Thr Val Trp. Gly He Lys Gin 

.... 95 ; 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gly Asp Gin 
110 115 120 

35 Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 

125 130 "5 

Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp 
140 145 150 

Arg He Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 
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155 160 165 

Asp Asn Tyr Thr Ser Glu lie Tyr Thr Leu He Glu Glu Ser Gin 
170 175 - ISO 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
5 185 190 . 195 

Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Lys Trp Leu Trp 
200 205 210 

Tyr He Lys lie Phe lie Met He Val Gly Gly Leu Val Gly Leu 
215 220 225 

10 Arg Leu Val Phe Thr Val Leu Ser He Val Asn Arg Val Arg Gin 

230 235 240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro ..Ala Pro Arg 
245 250 -255 

Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
15 260 265 268 

{2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 B amino acids 

(B) TYPE: Amino Acid 
20 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Gly Gly Gly Asp Met: Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 
25 20 25 30 . 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu 
35 40 45 

Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

30 Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 

65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
35 95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin 
110 . 115 12 0 

Gin Leu Leu Glu He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 

. 125 " ■ .130 135 
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■Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asn 
140 145 150 

Gl-n lie Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 
155 . 160 165 

5 Asp Asn Tyr Thr Ser Leu Xle Tyr Thr Leu lie Glu Glu Ser Gin 

170 175 ISO 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
X85 190 135 

Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr Asn Trp Leu Trp 
10 200 205 210 

Tyr lie Lys He Phe He Met He Val Gly Gly Leu Val Gly Leu 
215 220 225 

Argl He Val Phe Ser 1 Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 240 

15 Glv Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Ala Arg Arg 

245 250 255 

Glu Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 265 268 

(2) INFORMATION FOR : SEQ ID NO: 10: 

20 (i) SEQUENCE CHARACTERISTICS: 

t {A) LENGTH: 268 amino acids 
(B) .TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi} ; SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
t i 

25 Gly Gly Gly Asn Met Arg Asp Asn Trp -Arg Ser Glu Leu Tyr Lys 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys 
! 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu 
30 35 40 45 

Gly Ala Leu Phe Lett Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
f 55 55 60 

Gly Ala Ala Ser Leix Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 
65 70 75 

35 Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Met Ala He Glu 

. - i 8b S5 90 

Ala Gin Gin His Met Leu Glu Leu Thr Val Trp Gly He Lys Gin. 

95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr . Leu Lys Asp Gin 
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110 



115 120 



Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu lie Cys Thr 
125 130 135 

Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Ser 
5 140 145 150 • 

Asp lie Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu lie 
155 ISO 1^5 

Asp Asn Tyr Thr Asn Leu lie Tyr Ser Leu lie Glu Asp Ser Gin 
170 . 175 180 

10 lie Gin Gin Glu Lys Asn Glu Lys Glu Leu Leu Glu Leu Asp Lys 

185 190 195 

Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp 
200 205 " - 210 

Tyr lie Lys lie Phe He Met lie Val Gly Gly Leu lie Gly Leu 
15 215 220 225 

Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 .240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Gly Arg Arg 
24 5 250 255 

20 Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 

260 2 65 268 

(2) INFORMATION FOR SEQ ID NO : 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids ■■- 

25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

30 Tvr Lys Val Val Lys He Glu Leu Leu Gly Val Ala Pro Thr l*ys 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie 
35 40 - 45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
35 ' 50 55 - 60 

Gly Ala Ser Met Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser 
65 70. 75 

Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Gly 

80 85 . ... , .. 90 
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Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu 
95 100 105 

Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin 
110 3-15 120 

5 Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr 

125 130 135 

Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Glu 
140 145 150 

He Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu He Asp 
10 " 155 160 165 

Asn Tyr Thr Ser Leu He Tyr Thr Leu He Glu Glu Ser Gin Asn 
170 175 18 0 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Gly Leu Asp Lys Trp 
185 190 195 

15 Ala Ser Leu Trp Asn Trp Phe Thr He Thr Asn Trp Leu Trp Tyr 

200 205 210 

He Arg He Phe He Met lie Val Gly Gly Leu Val Gly Leu Arg 
215 220 225 

He Val Phe Thr Val Leu Ser He Val Asn Arg Val Arg Gin Gly 
20 230 235 240 

Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Ala Pro Arg Gly 
245 250 . 255 

Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 265 267 

25 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 ' 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

35 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 

35 40 45 

He Gly Ala Met Phe Leu Gly Phe Leu Gly Ala. Ala Gly Ser Thr 
50 55 60 

Met Gly Ala Thr Ser Met Thr Leu Thr Val Gin Ala Arg Leu Leu 
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65 . 70 75 .. 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
.80 85 ^° 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
5 95 100 . 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp 
110 H5 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 135 

10 Tire Thr Thr Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu 

140 145 . 150 

Asp Lys He Trp Gly Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
155 . 160 165 

He Asp Asn Tyr Thr Ser Leu He Tyr Thr Leu He Glu Glu Ser 
15 170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
185 190 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu 
200 205 210 

20 Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu Val Gly 

215 220 225 

Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Ser Gin 
25 ' * 245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 265 269 

(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 268 amino acids 

(B) TYPE: Amino Acid 
{D) TOPOLOGY: Linear 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 13 : 
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Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 " ~ 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 . 40 45 
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Gly Ala val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Gly Ala Ala Ala Met Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 
65 _ 70 75 

5 Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu 

SO 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin 
10 HO 115 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 135 

Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asn 
140 145 150 

15 Lys He Trp Asp Asn Met Thr Trp He Glu Trp Asp Arg Glu He 

155 160 165 

Asn Asn Tyr Thr Ser He He Tyr Ser Leu He Glu Glu Ser Gin 
170 175 180 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
20 185 130 1^5 

Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Lys Trp Leu Trp 
200 205 210 

Tyr He Lys lie Phe lie Met He Val Gly Gly Leu He Gly Leu 
215 '220 - ' 225 

25 Arg He Val Phe Ser Val Leu Ser He Val Asn Arg Val Arg Gin 

230 235 240 

" Glv Tyr Ser "Pro Leu Ser Phe Gin Thr. His Leu Pro Ser Ser Arg 
... 245 250 255 

Gly Pro Asp Arg Pro Gly Gly He Glu Glu Glu Gly Gly 
30 ~ 260 265 268 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 S amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser. Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
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20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu 
35 40 45 

Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
5 50 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He. Glu 
80 85 90 

10 Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin 
110 115 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
15 125 130 135 

Thr Thr Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu Asn 
140 145 ISO 

Glu He Trp Asp Asn Met Thr Trp Met Lys Trp Glu Arg Glu He 
155 160 165 

20 Asp Asn Tyr Thr His He He Tyr Ser Leu He Glu Gin Ser Gin 

170 175 ISO 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Ala Leu Asp Lys 
185 1S>0 155 

Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Lys Trp Leu Trp 
25 " 200 205 21° 

Tyr lie Lys He Phe He Met He Val Gly Gly Leu He Gly Leu 

215 220 225 . 

Arg He Val Phe Val Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 240 

30 Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Ala Gin Arg 

245 250 255 

Gly Pro Asp Arg Pro Asp Gly He Glu Glu Glu Gly Gly 
260 265 268 

(2) INFORMATION FOR SEQ ID NO: 15: 

35 (i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 26 7 amino acids 
(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION : . SEQ ID NO: 15: 
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Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
! " 5 10 15 

Tyr Lys Val Val Thr lie Glu Pro Leu Gly Val Ala Pro. Thr Lys 
20 25 30 

5 Ala Lys Arg Arg Val Val Glxi Arg Glu Lys Arg Ala Ala He Gly 

35 40 45 

Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 
50 55 60 

Ala Ala Ser Val Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser 
10 65 70 75 

Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg . Ala He Glu Ala 
80 85 90 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu 
95 100 105 

15 Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin 

110 115 120 

Ueu Leu Gly Phe Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr 
125 13 0 : 135 

Thr val . Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Asp 
20 ' 140 145 150 

lie Trp Asn Asn Met Thr Trp Met Gin Trp Glu Arg Glu He Asp 
155 160 165 

Asn Tyr Thr Ser Leu He Tyr Ser Leu Leu Giu Lys Ser Gin Thr 
170 175 ISO 

25 Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp "Lys Trp 

185 190 195 

Ala Ser Leu Trp .Asn Trp Phe Asp He Thr Asn. Trp Leu Trp Tyr 
200 205 210 

lie Lys He Phe He Met He Val Gly Gly Leu Val Gly Leu Arg 
30 215 220 225 

He Val Phe Ala Val Leu Ser lie Val Asn Arg Val Arg Gin Gly 
230 235 240 

Tyr Ser Pro Leu Ser Leu Gin Thr Arg Pro Pro Val Pro Arg Gly 
245 250 255 

35 Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 

260 265 267 

(2} INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 268 amino acids 
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(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

Cxi)" SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
5 1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

10 Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 55 60 

Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
15 " 80 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110 H5 120 

20 Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 

125 130 * 135 

Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu 
140 145 150 

Gin He. Trp Asn His Thr Thr Trp Met Glu Trp Asp Arg Glu lie 
25 155 ISO .165 

Asn Asn Tyr Thr Ser Leu lie His Ser Leu He Glu . Glu Ser Gin 
170 175 180 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
1B5 190 195 

30 Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp 

200 205 210 

Tyr He Lys Leu Phe He Met He Val Gly Gly Leu Val Gly Leu 
215 220 225 

Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin 
35 ~ 230 235 240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Thr Pro Arg 
245 250 255 

Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 

260 : 265 . . 263 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 268 amino acids • 

(B) TYPE: Amino Acid 

5 (D) TOPOLOGY: Linear ~ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
* * 5 10 IS 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
10 ' 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

15 Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 

65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

~ Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin 
20 95 100 105 

Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110 115 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 13 0 13 5 

25 " Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu 

140 145 150 

- Gin lie Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu lie 

155 16 0 1^5 

Asn Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin 
30 " 170 175 ISO 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
185 190 195 



Trp Ala Ser Leu Trp Asn ; Trp Phe Asn He Thr Asn Trp Leu Trp 
200 205 210 

35 Tyr He Lys He Phe lie Met He Val Gly Gly Leu Val Gly Leu 

215 220 225 

Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Thr Pro Arg 
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245 



250 . 255 



Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly 
260 265 268 

(2 ) INFORMATION FOR SEQ ID NOTlS: : 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY; Linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 18: 

10 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 ~ ~ 5 10 15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys 
~ * 20 25 3 0 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie 
15 " 35 40 45 

Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Gly Cys Thr Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

20 Ser Asp He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 

80 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg lie Leu. Ala Val Glu Arg Tyr Leu Lys Asp Gin 
25 110 H5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 135 



Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu 

140 145 150 

30 Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu He 

155 160 165 

Asn Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin 

170 175 180 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 

35 185 190 195 

Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp 

200 205 310 

Tyr He Lys Leu Phe He Met He Val Gly Gly Leu Val Gly Leu 

215. ' 220 225 
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Arq He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin 
230. 235 240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro lie Pro Arg 
245 250 255 

5 Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 

260 265 268 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 
10 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Gly iGly Gly Asp Met! Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
! i 5 10 15 

15 Tyr : Lys Val Val Lys He Glu Thr Leu Gly Val Ala Pro Thr Lys 

20 25 3 0 

Ala :Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
I 35! 40 45 

Gly I Ala Leu Phe Levj Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
20 *" 50" 55 60 

Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
; 65 70 75 

Ser 1 Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 9° 

25 Ala : Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
, 95 100 105 

• i t „. 

Leu^Gln Ala Arg lie Leu Ala Val Glu Arg Tyr -Leu Lys Asp Gin 
1 110 115 *- 20 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
30 125 130 135 

Thr! Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu 
I 140 145 150 

Glnj Phe Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu He 
155 160 

35 Asn Asn Tyr Thr Ser Leu lie His Ser Leu He Asp Glu Ser Gin 

170 175 180 

!•• . : -■ • 
Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
185 190 

Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp 
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200 



205 210 



Tyr He Lys He Phe He Met He Val Gly Gly Leu Val Gly Leu 
215 220 225 

Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin 
5 ~ 230- 235, .240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Asn Arg Gly 
245 250 255 

Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 . 265 268 

10 (2) INFORMATION FOR SEQ ID NO;20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 
* (B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 "* 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

20 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Ala 

35 40 45 

He Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 . 55 60 

Met Gly Ala Val . Ala Leu Thr Leu Thr Val Gin Thr Arg Gin Leu 
25 65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

30 Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 

HO H5 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 135 

Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
35 140 145 150 

Asp Lys lie Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
155- 160 165 

He Asp Asn Tyr Thr Asn Leu He Tyr Thr Leu He Glu Glu Ser 

• ... 170 - 175 ISO. 
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Gin Asn Gin Gin Glu Lys Asn Glu Lys Asp Leu Leu Glu Leu Asp 
185 1^0 195 

Thr Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Asn Trp Leu 
200 205 210 

5 Trp Tyr He Lys He Phe He Met He He Gly Gly Leu He Gly 

215 220 225 

Leu Arg He Val Phe Thr He Leu Ser Leu Val Asn Arg Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr. Arg Phe Pro Val Pro 
10 245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 265 269 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 9 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
20 1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 3 0 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Ala 
^35 40 45 

25 Leu Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 

50. ,55 60 

Met-Gly Ala Ala Ser Met Ala- Leu Thr Val Gin Thr Arg Gin Leu 
65 70 75 

Met Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Lys Ala He 
30 80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 
110 H5 120 

35 Gin Gin Leu Leu Arg lie Trp Gly Cys Ser Gly Lys Leu He Cys 

125 130 135 

Thr Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
140 145 150 

Asp Lys He Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
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1'5 5 



160 165 



lie Asp Asn Tyr Thr Gly Leu lie Tyr Thr Leu lie Glu Glu Ser 
170 175 ... ISO 

Gin He Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
5 185 1^0 195 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Lys Trp Leu 
200 205 210 

Trp Tyr He Lys He Phe lie Met He Val Gly Gly Leu He Gly 
215 220 225 

10 Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val Arg 

23 0 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Ala Gin 
245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly 
15 * 260 265 269 

(2-) INFORMATION FOR SEQ ID NO : 22 : 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 9 amino acids 

(B) TYPE: Amino Acid 
20 (D) TOPOLOGY: Linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
25 * " 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 
35 40 45 

lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

30 Met Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Leu Leu 

65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
80 85 90. 
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Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 . . 105. 

Gin .Leu Gin Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp 

■ no . . 115 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 13° 135 
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Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Thr Leu 
140 145 150 

Asp Met lie Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
155 160 165 

5 lie Glu Asn Tyr Thr Gly Leu lie Tyr Thr Leu He Glu Glu Ser 

170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu Asp 
185 190 1^5 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp He Ser Asn Trp Leu 
10 * 200 205 210 

Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly 
215 220 225 

Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val Arg 
230 235 240 

15 Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Ala Pro 

245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 • 265 269 

(2) INFORMATION FOR SEQ ID NO : 2 3 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 
{B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

25 Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Lys Leu Tyr Lys 
15 10 15 

- ~ Tyr Lys Val Val Lys lie: Glu . Pro Leu Gly . Val Ala Pro Thr Lys 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Lys Lys Arg Ala Val Gly Thr 
30 35 40 45 

lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

Met Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Leu Leu' 
65 70 75 

35 Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 

80 85 90 

Glu Ala Gin Gin His Leu. Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

. Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gin. Asp 

-163- 



_S820036A1J_> 



WO 98/20036 



PCT/US97/2O069 



no 



115 120 



Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 135 

Thr Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
5 140 145 150 

Asp Glu He Xaa Asn Asn Met Thr Trp Met Gin Trp Glu Arg Glu 
155 16 0 165 

lie Ser Asn Tyr Thr Ser Leu He Tyr Thr Leu He Glu Glu Ser 
17 0 175 I 80 

10 Gin Asn Gin Gin Glu Lys Asn Glu Leu Glu Leu Leu Glu Leu Asp 

185 190 1^5 

Lvs Trp Ala Ser Leu Xaa Asn Trp Phe Asp He Thr Asn Trp Leu 
200 205 210 

Trp Ser He Lys He Phe He Met He Val Ala Gly Leu Val Gly 
15 215 220 225 

Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Thr Pro 
245 250 255 

20 Arg Gly Pro Asp Arg Pro Gly Gly He Glu Glu Glu Gly Gly 

260 265 269 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 2 6 9 amino acids 
25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 24: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lye 
1 5 10 15 

30 Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu 
35 40 45 

lie Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
35 ' ' J SO 55 60 

Met Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu 
65 70 75 

Leu Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He 
80 85 90 
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Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys 
95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 
110 m US 120 

5 Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu Val Cys 

125 130 135 

Thr Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
140 ' 145 150 

Asn Gin He Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
]0 155 160 165 

lie Asp Asn Tyr Thr Gly Leu lie Tyr Arg Leu He Glu Glu Ser 
170 175 180 

Gin Asn Gin Gin Glu Gin Asn Glu Gin Asp Leu Leu Lys Leu Asp 
185 1^0 195 

15 Thr Trp Ala Ser Leu Trp Asn Trp Phe Ser He Thr Lys Trp Leu 

200 205 210 

- Trp Tyr lie Lys He Phe lie Met He Val Gly Gly Leu He Gly 
215 220 225 

- Leu Arg He He Phe Ala Val Leu Ser lie Val Asn Arg Val Arg 
20 230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala Pro 
245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 ' 265 269 

25 (2) INFORMATION FOR SEQ ID NO:25 : "" 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE; Amino Acid 
(D) TOPOLOGY; Linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

35 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu 

35 40 45 

Leu . Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

Met Gly Ala Arg Ser Met Ala Leu Thr Val Gin Ala Arg Gin Leu 
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Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
80 85 50 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
5 95 100 : 1Q5 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp 
110 US 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 I 30 135 

10 Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Met 

140 145 150 

Asp Met He Trp Asn Asn Met . Thr Trp Met Glu Trp Glu Arg Glu 
155 160 165 

He Asp Asn Tyr Thr Ser Leu He Tyr Thr Leu He Glu Glu Ser 
15 170 175 1.80 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asn 
X85 190 1^5 

Lys Trp Glu Asn Leu Trp Ser Trp Phe Asp He Ser Asn Trp Leu 
ZOO 205 210 

20 Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu Val Gly 

215 220 225 

Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Ser Val Arg 
23 0 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Ala Pro 
25 245 250 255. 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 265 269 

(2) INFORMATION FOR SEQ ID NO: 26 : 

(i) SEQUENCE CHARACTERISTICS: 
30 (A). LENGTH: 26 8 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



35 



Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
x " " ^ 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Thr Leu 

- 35 < 40 45 
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- Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 55 6° 

Gly Ala Arg Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
~ 65 , 7 ° 75 

5 Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 

80 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
10 HO H5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 135 

Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp 
140 145 150 

15 Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 

155 160 165 

Asp Asn Tyr Thr Asn Leu He Tyr Thr Leu He Glu Glu Ser Gin 
170 175 180 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
20 185 190 

Trp Ala Ser Leu Trp Asn Trp Phe Asp He Ser Lys Trp Leu Trp 
200 205 210 

Tyr He Lys lie Phe He Met He Val Gly Gly Leu Val Gly Leu 
215 220 225 

25 Arg lie Val Phe Thr Val Leu Ser He Val Asn Arg Val Arg Gin 

230 . 235 240 

Gly Tyr Ser Pro Leu Ser 'Phe Gin Thr Arg Phe Pro Ala Pro Arg 
245 250 255 

Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
30 260 265 268 

(2)" INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:27: 

Ala Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys 
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20 25 2 0 

Ala Arg Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Met 
35 40 .45 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
5 50 55 60 

Met Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu 
65 70 75 

Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
80 85 90 

10 Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 

95 . 100 105. 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 
110 115 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
15 125 130 135 

Thr Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
140 145 150 

Asn Glu He Trp Asp Asn Met Thr Trp Met Gin Trp Glu Arg Glu 
155 160 165 

20 He Asp Asn Tyr Thr His Leu He Tyr Thr Leu Tie Glu Glu Ser 

170 175 ISO 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
185 190 1&5 

Lys Trp Leu Trp Ser Trp Phe Ser He Thr Asn Trp Leu Trp Tyr 
25 200 205 210 

He Arg He Phe He He He Val Gly Gly Leu Val Gly Leu Arg 
215 220 225 . 

He Val Phe Ala Val Leu Ser lie Val Asn Arg Val Arg Gin Gly 
230 235 240 

30 Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Thr Gin Arg Gly 

245 250 255 

Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 265 267 

(2) INFORMATION FOR SEQ ID NO: 28: 

35 (i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH: 26 9 amino acids 

(B) TYPE: Amino Acid 
{D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2.8: , ■ 
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i 1 

Ala Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys 

5 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Val 

35 40 45 

lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

Met Gly Ala Ala Ser lie Thr Leu Thr Val Gin Ala Arg Lys Leu 
10 65 70 75 

Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie 
80 85 9 0 

Glu kla Gin Gin His! Leu Leu Gin Leu Thr Val Trp Gly lie Lys 
95' 100 105 

15 Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp 

110 115 120 

Gin Gin Leu Leu Glyi.Ile Trp Gly Cys Ser Gly Lys Leu lie Cys 
125" 130 135 

Thr 'Thr Thr Val Pro! Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu 
20 140 145 150 

Asp -Lys lie Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
155- 160 165 

i - I 

He ! Asp Asn Tyr Thr ; Ser Leu He Tyr Thr Leu Leu Glu Glu Ser 

' . 170- 175 180 

• .n f • , ■ 

25 Gin : Asn Gin Gin Glu^ Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
*■ iSS: 190 195 

• l ■ - i 

Lys.krp Ala Ser Leu Trp Asn Trp Phe Ser He Thr Asn Trp Leu; - 
I 200= 205 210 

Trp 'Tyr He Arg lie Phe He Met He Val Gly Gly Leu He Gly 
30 215 220 225 

Leu I Arg He He Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
J 230; 235 240 

Gin ' Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu He Pro Ala Gin 
245 250 255 

35 Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Gly Gly Gly 
| 260j . . 265 269 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH: 26 9 amino acids 
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(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

si's 10 « 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Met 
35 40 45 

10 Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 

50 55 .60 

Met Gly Ala Thr Ser Met Ala Leu Thr Val Gin Ala Arg Gin Leu 
65 70 75 

Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 

I5 so as 90 

Lys Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp 
110 115 120 

20 Gin Gin Leu Leu Gly Phe Trp Gly Cys Ser Gly Lys Leu He Cys 

125 130 135 

Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Thr Leu 
14.0 145 150 

Asp Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu 
25 " 155 160 165 

He Asp Asn Tyr Thr His Leu He Tyr Thr Leu He Glu Glu Ser 

170 175 180 

Gin Asn Gin Gin Glu Lys Asn Gin Gin Glu Leu Leu Gin Leu Asp 
185 190 1^5 

30 Lys Trp Ala Ser Leu Trp Thr Trp Ser Asp He Thr Lys Trp Leu 

200 205 . 210 

Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly 
215 220 225 

Leu Arg lie Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
35 ~ 230 235 , 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Asn Pro 
245 250 255 

Arq Gly Pro Asp Arg Pro Glu Gly Thr Glu Glu Gly Gly Gly 
260 265 269 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 9 amino acids 

(B) TYPE: Amino Acid 
5 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tvr Lys Val lie Lys lie Glu Pro Leu Gly lie Ala Pro Thr Lys 
10 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

Val Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 SO 

15 Met Gly Ala Val Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu 

65 70 75 

' L eu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie 

80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
20 95 100 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp 
110 H5 120 

Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 I 35 

25 Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 

140 145 150 

Glu Asp Tie Tr^ Asp Asn Met _ Thr Trp" Met Gin Trp Glu Arg Glu 
155 160 1 SS 

He Asp Asn Tyr Thr Asn Thr He Tyr Thr Leu Leu Glu Glu Ser 
30 " ' 170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
185 190 l g5 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Ser He Thr Asn Trp Leu 
200 205 210 

35 Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu Val Gly 

215 220 ■•" 225 

Leu Arg He Val Phe Ala Val Leu Ser lie Val Asn Arg Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg. Leu Pro Val Pro 
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245 



250 .255 



Arg Gly Pro Asp Arg Pro Asp Gly lie Glu Glu Glu Gly Gly 

260 265 269 . 

(2) INFORMATION FOR SEQ ID NO: 31: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: Amino Acid 
{D) TOPOLOGY: Linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

10 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Gin Leu Tyr Lys 
l 5 10 15 

Tyr i,y S val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
20 25 30 



Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

He Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 



50 



55 . 60 



Met Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Lys Leu 
65 70 75 

20 Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 

80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 1°5 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp 
25 HO H5 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 135 

Thr Thr Ala Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Met 
140 145 150 

30 Glu Asp He Trp Asp Asn Met Thr Trp Met Gin Trp Glu Lys Glu 

155 160 .165 

He Asp Asn Tyr Thr Asn Thr He Tyr Thr Leu Leu Glu Glu Ser 
170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
35 185 190 195 

Lvs Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu 
200 205 210 

Tm Tvr He Lys He Phe He Met He Val Gly Gly Leu Val Gly 

215 ■ 220 " . 225 
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Leu Arg He Val Phe Ala Val Leu Ser Val Val Asn Arg Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Thr Pro 
" ' 245 " 250 255 

5 Arg Gly Pro Asp Arg Pro Asp Gly He Glu Glu Glu Gly Gly 

260 255 269 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 269 amino acids 
10 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Asn Glu Leu Tyr Lys 
1 5 10 15 

15 Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Met 
35 40 4 5 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
20 ' 50 55 60 

Met Gly Ala Arg ser Leu Thr Leu Thr val Gin Ala Arg Gin Leu 
65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He- 
80 as so 

25 Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 

95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg "Tyr Leu Arg Asp 

1X0 115 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
30 125 130 135 

Thr Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Thr Leu 
140 145 150 

Asp Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
155 160 155 

35 He Asp Asn Tyr Thr Ser Leu He Tyr Thr Leu He Glu Gin Ser 

170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu' Asp 
185 190 195 

Lys Trp Ala Ser Leu Trp Ser Trp Tyr Asp He Ser Asn Trp Leu 
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200 



205 210 



Trp Tyr lie Lys lie Phe He Met He Val Gly Gly Leu lie Gly 
215 220 225 

Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
5 230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala Thr 
245 250 255 

Arg Gly Pro Arg Gin Pro Glu Glu He Glu Glu Glu Gly Gly 
260 265 269 

10 (2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 269 amino acids ' 7 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 ■ 3-5 

Tyr Lys Val Val Arg He Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 25 30 

20 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 

35 40 45 

He Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

Met Gly Ala Gly Ser He Thr Leu Thr Val Gin Ala Arg His Leu 
25 " 65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

30- Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp 

110 H5 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 135 

Thr Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
35 140 145 X50 

Asn Met He Trp Asn Asn Met Thr Trp Met Gin Trp Glu Arg Glu 
: 155 " 160 165 

He Asp Asn Tyr Thr Gly He He Tyr Asn Leu Leu Glu Glu Ser 
170 ■ 175 ■ 180 
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Gin Asn Gin Gin 
Lys Trp Ala Asn 
5 Trp Tyr lie Arg 
Leu Lys lie Val 



Glu Lys Asn Glu Gin Glu 

185 190 

Leu Trp Asn Trp Phe Asp 

200 * 205 

He Phe He Met He Val 

215 220 

Phe Ala Val Leu Ser He 

230 235 



Leu Leu Glu Leu Asp 
195 

He Thr Gin Trp Leu 
210 

Gly Gly Leu Val Gly 
225 

Val Asn Arg Val Arg 
240 

Leu Pro Ala Pro 
255 

Glu Gly Gly 
269 



Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His 
10 245 250 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Gly 
260 265 

(2) INFORMATION FOR SEQ ID NO: 34; 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 269 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
20 1 5 10 15 

Tyr Lys Val Val Arg He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 3 0 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 
^35 40 45 

25 He Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 

50 55 6 0 

Met Gly Ala Gly Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu 

65 "' 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie 
30 80 85 90 

Asp Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp 
HO 115 120 

35 Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 

125 130 • 135 

Thr Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Met 
■ 140 145 150 

Asn Gin He Trp Asp Asn. Leu Thr Trp Met Glu Trp Glu Arg Glu 
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155 160 165 

He Asp Asn Tyr Thr Ser He He Tyr Ser Leu He Glu Glu Ser 
170 175 180 

Gin Asn Gin Gin Gly Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
5 185 190 1^5 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Asn Trp Leu 
200 205 210 

Trp Tyr lie Lys He Phe He Met He Val Gly Gly Leu He Gly 
215 220 225 

10 Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val Arg 

230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Thr Pro 
245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
15 260 265 269 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 26 8 amino acids 

(B) TYPE: Amino Acid 
20 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 .15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Xaa 
25 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

Gly Ala Ala Ser Pro Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 SO 

30 . Xaa Ala Ala Pro Thr Thr Leu Thr Val Gin Pro Arg Gin Leu Leu 

65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
35 95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110 115 12 0 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 13 5 
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Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp 
140 145 150 

Glu He Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 
155 " ISO 165 

5 Asn Asn Tyr Thr Gly Leu He Tyr Thr Leu He Glu Glu Ser Gin 

170 175 «0 

Xaa Gin Gin Glu Lys Asn Glu Leu Asp Leu Leu Glu Leu Asp Lys 
185 190 195 

Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Asn Xaa Leu Trp 
10 200 205 210 

Tvr He Lys He Phe He Met He Val Gly Gly Leu Val Gly Leu 
215 220 225 

Arcr He He Phe Thr Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 240 

15 Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Xaa Pro Arg 

245 250 255 

Gly Pro Asp Arg Pro Gly Gly He Glu Glu Glu Gly Gly 
260 265 268 

(2) INFORMATION FOR SEQ ID NO: 36: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 269 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

25 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 • 5 10 15 

Tyr Lys Val Val Arg He Glu Pro Leu Gly He Ala Pro Thr Arg 
20 25 30 



Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 
35 40 45 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 



50 



55 SO 



Met Gly Ala Ala Ser Val Ala Leu Thr Val Gin Ala Arg Gin Leu 
65 70 75 

35 Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 

80 35 90 

Qlu Ala Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gly Asp 
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110 



115 120 



Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 135 

Thr Thr Thr Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu 
5 140 145 150 

Asp Asp He Trp Thr Asn Met Thr Trp Met Glu Trp Lys Arg Glu 
155 160 165 

He Asp Asn Tyr Thr Ser Leu He Tyr Thr Leu He Glu Glu Ser 
170 175 180 

10 Gin Arg Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 

185 190 195 

Lys Trp Asp Ser Leu Trp Asn Trp Phe Thr He Ser Lys Trp Leu 
200 " 205 210 

Trp Tyr He Lys He Phe He Met He Val Ala Gly Leu Val Gly 
i5 215 220 225 

Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Lys Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Val Ser Phe Gin Thr Arg Leu Pro Ala Gin 
245 250 255 

20 Arg Gly Pro Asp Arg Pro Glu Glu He Glu Glu Glu Gly Gly 

260 265 269 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 
25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
x 5 10 15 

30 Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 

... 20 25 30 

Pro Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 
35 40 .45 

He Gly Ala Met Phe Leu Gly Phe Leu Gly Thr Ala Gly Ser Thr 

35 J 50 55 ... 60 

Met Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu 
65 70 .75 

Leu Ser Gly He Val Gin Gin Gin Arg Asn Leu Leu Arg Ala He 

80 - 85 , 90 
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Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys 
95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 
110 ■ H5 120 

5 Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 

125 130 135 

Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
140. 145 150 

Asp Lys He Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
10 ~ 155 ISO 165 

lie Asp Asn Tyr Thr Arg Glu He Tyr Thr Leu He Glu Glu Ser 
i 170 175 I 80 

Gin ksn Gin Gin Glu Lys Asn Glu Leu Glu Leu Leu Glu Leu Asp 
185 190 195 



15 



Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Lys Trp Leu 
200 205 210 



Trp W He Lys lie! Phe He Met He Val Gly Gly Leu Val Gly 
215 220 225 

Leu ' Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
20 230 235 240 

Gin j Gly Tyr Ser Pro Leu Ser Leu Gin Thr Arg Phe Pro Ala Gin 
245 250 255 

Arg 'Gly Pro Gly Gly Pro Glu Gly He Glu Glu Glu Gly Gly 
; 260 265 269 

• 1 ' ' ! • ' 

25 (2) INFORMATION FOR ; SEQ ID NO: 38: 

i • f 
(t) SEQUENCE CHARACTERISTICS: 

j (A) LENGTH: 267 amino acids 

(B) TYPE: Amino Acid 
"? (D) TOPOLOGY: Linear 

30 (X,i) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
a! 5 10 15 

Tvr Lys Val Val Lys He Glu Pro Leu Gly lie Ala Pro Thr Lys 
20 25 30 

35 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
1 35 40 45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Gly Ala Ala Ala Met Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 

-179- 



BNSOOCID: <WO 9820038A1 _l_> 



WO 98/20036 PCT/US97/20069 



65 



70 75 



Thr Gly lie val Gin Gin Gin Asn Asn Leu Leu Lys Ala lie Glu 
80 S5 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
5 95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110 H5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu lie Cys Thr 
125 130 

10 Thr Thr Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu Asp 

140 145 150 

Lys He Trp Gly Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 
155 160 165 

Asp Asn Tyr Thr Gly Leu He Tyr Thr Leu He Glu Glu Ser Gin 
j5 " * 170 175 180 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
185 190 195 

Trp Ala Ser Leu Trp Asn Trp Phe Thr He Thr Asn Trp Leu Trp 
200 . 205 216 

20 Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu 

215 220 225 

Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Ala Pro Arg 
25 " 245 250 255 

Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly 
26 0 265 . 267 

(2} INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS; 
30 (A) LENGTH: 268 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Gly Gly Gly Asp Met Arg Glu Asn Trp Arg Ser Glu Leu Tyr Lys 
35 i ' " 5 10 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 3° 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Phe 
35 . 40 45 
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Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 
65 - 70 75 

5 Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 

80 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
10 110 H5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 1^5 

Thr Asn Val Pro Trp Asn Lys Thr Trp Ser Asn Lys Ser Leu Asp 
140 145 150 

15 Gin He Trp Gin Asn Met Thr Trp Met Gin Trp Glu Arg Glu He 

155 160 I 65 

Asr> Lys Tyr Thr Asp Val He Tyr Thr Leu He Gly Glu Ser Gin 
170 175 180 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
20 185 190 1^5 

Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Gin Trp Leu Trp 
200 205 210 

Tyr He Lys He Phe He Met He Val Gly Gly Leu Val Gly Leu 
■ 215 " 220 225 

25 Arg lie Val Phe Ser Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Ala Ala Arg • 

245 250 255 

Gly Pro Asp Arg Pro Glu- Gly lie Glu Glu Glu Gly Gly 
30 ~ ~ 260 265 268 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS :■ 

(A) LENGTH: 268 amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: 

Gly Gly Gly Asn Met Lys Asp 
1 * 5 

Tyr Lys Val Val Arg lie Glu 



SEQ ID NO; 40 : 

Asn Trp Arg Ser Glu Leu Tyr Lys 
10 15 

Pro Leu Gly He Ala Pro Thr Arg 
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20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Ala Gly Leu 
35 40 45 

Gly Val Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
5 50 55 60 

Gly Ala Ala Ser He Ala Leu Thr Val Gin Ala Arg Gin Leu Leu 

7D 75 



10 



65 



Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 95 90 

Ala Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin 
110 U5 I 20 

Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
15 125 130 135 

Thr Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser His Asp 
140 145 150 

Gin He Trp Gin Asn Met Thr Trp Met Gin Trp Glu Lys Glu He 
155 160 1S5 

20 Asp Asn Tyr Thr Ser Leu He Tyr Asn Leu He Glu Val Ser Gin 

170 175 18° 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
185 190 195 

Trp Ala Ser Leu Trp Asn Trp Phe Asp Tie Thr Asn Trp Leu Trp 
25 200 205 . 210 

Tyr He Lys He Phe He Met He. Val Gly Gly Leu He Gly Leu 
215 220 225 

Arg He Val Phe He Val Leu Ser He Val Asn Arg Val Arg Gin 
230 235 240 

30 Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Ala Arg Arg 

245 250 ; 255 

Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
260 265 268 

(2) INFORMATION FOR SEQ ID NO: 41: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 
<B) TYPE: Amino Acid - 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 41: 
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Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 ^ 5 10 15 

Tvr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 - 25 30 

5 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 

35 40 45 

Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 
10 €5 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 2-05 

15 Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 

110 H5 120 

Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 135 

Thr Thr Val Pro Trp Asn Thr Ser " Trp Ser Asn Lys Ser Leu Asp 
20 140 145 150 

Gin He Trp Gly Asn Met Thr Trp Met Gin Trp Glu Arg Glu He 
155 160 165 

Asp Asn Tyr Thr Gly Leu lie Tyr Thr Leu lie Glu Glu Ser Gin 
170 175 180 

25 Asn Gin Gin Glu Lys Asn Glu Gin "Glu Leu Leu Glu Leu Asp Lys 

185 1^0 1^5 

-- Trp Ala Ser Leu- Trp -Asn Trp- Phe Asn lie Thr Asn Trp Leu Trp 

200 205 210 

Tyr tie Lys He Phe He Met He Val Ala Gly Leu Val Gly Leu 
30 ' 215 220 225 

Arg Val Val Phe He Val Leu Ser He Val Asn Arg Val Arg Gin 

230 . 235 • 240 

Glv Tvr Ser Pro Leu Ser Phe Gin Thr His His Pro Ala Leu Arg 
245 250 255 

35 Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 

.... 260 265 268 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 
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(B) TYPE: Amino Acid 
{D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

5 I . 5 ... 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 40 4 5 

10 Val Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 

SO 55 . SO 

Met Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Leu Leu 
€5 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
15 80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 
HO 115 I 20 

20 Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 

125 130 135 

Thr Thr Thr Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu 
140 145 150 

Ser Glu He Trp Asp Asn Met Thr Trp Met Gin Trp Glu Arg Glu 
25 155 150 165 

He Asp Asn Tyr Thr Ser Leu He Tyr Thr Leu He Glu Glu Ser 
170 175 130 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
185 190 195 

30 Lys Trp Ala Gly Leu Trp Asn Trp Phe Glu He Thr Asn Trp Leu 

200 205 210 

Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu Val Gly 
215 220 225 

Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
35 ~ 230 235 240 

Gin Gly Tyr Ser Pro Val Ser Phe Gin Thr His Leu Pro Ala Pro 
245 250 255 . 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 

260 .... 265 .. - 269 
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(2) INFORMATION FOR SEQ ID NO; 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: Amino Acid 
5 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43 : 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1*5 10 15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg 
10 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Ala 
35 40 45 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

15 Met Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu 

65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
80 35 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
20 95 100 105 

Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp 
110 H5 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
125 130 135 

25 Thr Thr Ala Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu 

140 145 150 

Glu Lys lie Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
155 160 " " 165 

He Asp Asn Tyr Thr Gly Leu He Tyr Ser Leu He Glu Glu. Ser 
30 170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Glu Leu Asp 
IBS 190 1^5 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr Lys Trp Leu 
200 205 210 

35 Trp Tyr He Lys He Phe lie Met He Val Gly Gly Leu He Gly 

215 220 : , 225 

Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg ' 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala Gin 
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245 250 255 

Arg Gly Pro Asp Arg Pro Gly Gly He Glu Glu Glu Gly Gly 
260 265 263 

(2) INFORMATION FOR SEQ ID NO: 44: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

10 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 X0 15 

Tyr Lys Val Val. Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Lys Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Val 
15 35 40 45 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 SO 

Met Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu 
65 70 75 

20 Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 

80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

Gin Leu Gin Ala Arg He Leu Ala Met Glu Arg Tyr Leu Lys Asp 
25 110 H5 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu lie Cys 
125 130 135 

Thr Thr Ala Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu 
140 145 150 

30 Glu Lys He Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu 

155 160 165 

He Asp Asn Tyr Thr Gly Leu He Tyr Ser Leu He Gly Glu Ser 
170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Glu Leu Asp 
35 185 ISO 1^5 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr Lys Trp Leu 

200 205 210 : , 

Trp Tyr He Lys He Phe lie Met He Val Gly Gly Leu Val Gly 

215 • < 220 225 
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Leu Arg lie Val Phe Ala val Leu Ser He Val Asn Arg Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala Gin 
245 - 250 255 

5 Arg Gly Pro Asp Arg Pro Gly Gly He Glu Glu Glu Gly Gly 

260 265 269 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2 69 amino acids 
10 (b) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(XX ) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Gly Gly Gly Asp Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
x 5 10 -15 

15 Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 

20 25 30 

---- - Ala Lys Ser Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Val 

35 40 45 

Leu Gly Ala Met Phe Leu Gly Leu Leu Gly Ala Ala Gly Ser Thr 
20 ' 50 55 60 

Met Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu 
65 70 75 

Xeu Ser Gly lie Val" Gin Gin Gin Asn Asn Leu Leu Arg Ala He 

80 " ~ " S5 SO 

25 Glu Ala Gin Gin His Leu Ser Gin Leu Thr Val Trp Gly He Lys 
" 95 100 105 

• • Gin Leu Gin Ala -Arg Val -Leu Ala- Val Glu Arg Tyr Leu Lys Asp 

110 : " H5 ' 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
30 125 130 135 

Pro Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Arg Ser Leu 
140 • 145 — 150 

Gin Tyr He Trp Asn Asn Met Thr Trp He Glu Trp Glu Arg Glu 
155 160 165 

35 He Asp Asn Tyr Thr Asp lie He" Tyr Ser Leu He Glu Lys Ser 

170 175 18° 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
185 1^0 1^5 

Gin Trp Ala Ser Leu Trp Asn Trp Phe Ser He Thr Lys Trp Leu 
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200 205 210 

Trp Tyr He Lys Leu Phe He Met He Val Gly Gly Leu He Gly 
215 220 225 

Leu Arg He Val Phe Ala He Leu Ser He Val Asn Arg Ala Arg 
5 230 235 24.0 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala Pro 
245 25 0 25 5 

Arg Gly Leu Asp Arg .Pro Glu Gly He Gly Glu Glu Gly Gly 
260 265 269 

10 (2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids. ....... 

(B) TYPE: Amino Acid 
{D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 : 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
! 5 10. 15- 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

20 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 

35 40 45 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

Met Gly Ala Ala Ser Met Thr Leu Thr Val Gin Ala Arg Leu Leu 
25 65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
80 85 9.0 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

30 Gin Leu Glh Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 

110 115 120 

Arg Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Pro He Cys 
125 130 135 



Thr Thr Ser Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Leu 

35 140 145 , 150 

Glu Gin lie Trp Asn Asn Met Thr Trp Leu Glu Trp Glu Arg Glu 

■ 155 160. . 165 

He Asp Asn Tyr Thr Ser Leu He Tyr Ser Leu He Lys Glu Ser 

■170 - ' : 175 • 180 
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Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp 
185 19° 195 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr Glu Trp Leu 
200 205 210 

5 Trp Tyr lie Lys lie Phe lie Met He Val Gly Gly Leu Val Gly 

215 220 225 

Leu Arg lie Val Phe Thr Val Leu Ser He Val Asn Arg Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg. Leu Pro Ala Pro 
10 " 245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly 
2S0 265 269 

i 

(2)* INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 269 amino acids 

(B) TYPE: Amino Acid 
I (D) TOPOLOGY: Linear 

'(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
20 1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Val 
35 40 45 

25 lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 

SO 55 60 

i - ;■ 

Met Gly Ala Ala Ser Met Ala Leu Thr Val Gin Ala Arg Gin Leu 
i 65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
30 80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
i 195 100 105 

din Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 
110 H5 120 

35 Gin Gin Leu Leu Gly He Trp Gly eys Ser Gly Lys Leu He Cys 
i 125 I 30 135 

Thr Thr Ala Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu 
140 145 150 

Glu Glu He Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu 
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155 160 . 165 

He ash Asn Tyr Thr Gly Leu He Tyr Thr Leu He Glu Gin Ser 
170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Ala Leu Asp 
5 18 5 190 195 

Thr Trp Ala Ser Leu Trp Asn Trp Phe Ser He Ser Asn Trp Leu 
200 205 210 

Trr> Tyr He Arg He Phe He Met He Val Gly Gly Leu Val Gly 
215 220 225 

10 Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 

230 235 .240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Thr Pro 
245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
15 " 260 265 269 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: Amino Acid 
20 CD) TOPOLOGY; Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4S: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 . 10 15 

Tyr Lys Val Val Lys He Glu Pro He Gly Val Ala Pro. Thr Lys 
25 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 
35 40 45 

He Gly Val Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

30 Met Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu 

65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie 
80 ■ 85 90 

Lys Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
35 J 95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala He Glu Arg Phe Leu Arg Asp 

110 . . ■ 115 ' 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 

. 125 - , 13P . 135 
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Thr Thr Thr Val Pro Trp Asn Thr Ser Trp Ser Asn Lys Ser Leu 
140 145 150 

Lys Gin He Trp Asp Asn Leu Thr Trp Met Glu Trp Glu Arg (glu 
155 160 165 

5 He Asp Asn Tyr Thr Gly He He Phe Asn Leu He Glu Glu Ala 

170 175 180 

Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Glu Leu Asp 
185 1^0 135 

Lys Trp Ala Gly Leu Trp Asn Trp Phe Ser He Thr Asn Trp Leu 
10 200 205 210 

Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly 
215 220 225 

Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
230 235 240 

15 Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro Thr Pro 

245 250 255 

Arg Gly Pro Asp Arg Pro Glu Gly Thr Gly Glu Glu Gly Gly 
260 265 269 

(2) INFORMATION FOR SEQ ID NO:49; 

20 {i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: Amino Acid 
<D) TOPOLOGY: Linear 

(3Ci) SEQUENCE DESCRIPTION: SEQ ID NO:49; 

25 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
■L ~ 5 10 15 

Tyr Lys Val Val Arg He Glu Pro Leu Gly lie Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
30 ' "* 35 40 45 

Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 " 55 60 

Glv Ala Ala Ser lie Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 
65 70 75 

35 Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 

80 S5 .90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin 
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HO 1X5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
12 5 "0 135 . 

Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp 
5 140 145 150 

Gin He Trp Asp Asn Met Thr Trp Met Gin Trp Glu Arg Glu He 
155 160 165 

Glu Asn Tyr Thr Ser Leu He Tyr Asn Leu He Glu Glu Ser Gin 
170 175 180 

10 Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Glu Leu Asp Lys 

185 190 195 

Trp Ala Ser Leu Trp Ser Trp Phe Ser He Thr Asn Trp Leu Trp 
200 205 210 

Tyr He Arg He Phe He Met He Val Gly Gly Leu He Gly Leu 
15 215 220 225 

' Arg lie Val Phe Ala Val Leu Ser 

230 233 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 233 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

Gly Gly Gly Asp Met Arg Asp" Asn Trp Arg Ser Glu Leu Tyr Lys 
25 1 ' 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
'35 40 45 

30 Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
35 80 95 90 

Ala Gin Gin His. Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin 
110 115 120. 

-192- 



9B20D36A1J > 



WO 98/20036 



PCTYUS97 J '2O069 



Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 135 

Thr Ala val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Leu Asp 
140 . . 145 150 

5 Gin He Trp Asn Asn Met Thr Trp Met Gin Trp Glu Arg Glu Tie 

155 160 165 

Glu Asn Tyr Thr Ser Leu He Tyr Asn Leu He Glu Glu Ser Gin 
170 175 180 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
10 185 190 X95 

Trp Ala Ser Leu Trp Asn Trp Phe Ser He Thr Asn Trp Leu Trp 
200 205 210 

Tyr He Lys He Phe lie He He Val Gly Gly Leu He Gly Leu 
215 220 225 

15 Arg He Val Phe Ala Val Leu Ser 

230 233 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 
20 (B) TYPE: Amino Acid 

<D) TOPOLOGY: Linear 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: SI: 

Ala Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

25 Tyr Lys Val He Gin He Glu Pro Leu Gly Val Ala Pro Thr Lys 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 -" 40 45 

Gly Ala Val Leu Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
30 50 55 50 

Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

35 Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
.95 I 00 1 

Leu Gin Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gin 
110 115 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
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125 130 135 

Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asn 
140 .145 150 

Asp He Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg Glu He 
5 " 155 .- 1^0 I 65 

Asn Lys Tyr Thr Asp Ser He Tyr Gin Leu He Glu Glu Ser Gin 
170 175 1 8 ° 

Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Lys Leu Asp Glu 
185 190 1^5 

10 Trp Ala Ser Leu Trp Asn Trp Phe Ser He Ser Lys Trp Leu Trp 

200 205 210 

Tyr He Lys He Phe He Met He Val Gly Gly Leu Val Gly Leu 
215 220 22S 

Arg He Val Phe Ala Val Leu Ser 
15 23 0 23 3 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: Amino Acid 
20 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Tnr Arg 
25 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Arg Arg Ala Val Gly Ala 
35 40 45 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

30 Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu 

65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Lys Ala He 
80 85 90 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
35 95 100 

{2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 105 amino acids 
(B) TYPE: Amino Acid. 
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(D) TOPOLOGY: Linear 

{Xi> SEQUENCE DESCRIPTION: SEQ ID NO; 53: 

Gly Gly Gly Asp Met Lys Asp Asn Trp Arg Ser Lys Leu Tyr Lys 
X 5 1° 15 

5 Tvr Lys val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 
35 40 .45 

lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
10 * 50 55 60 

Met Gly Ala Ala Ser lie Thr Leu Met Val Gin Ala Arg Gin Leu 
65 70 75 

Leu Ser Gly He Val Gin Gin Gin Arg Asn Leu Leu Arg Ala He 
80 S5 9° 

15 Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 

95 100 

(2) INFORMATION FOR SEQ ID NO ; 54 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

i 5 10 • - - ■-: ^ 5 

25 Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Arg Arg Ala Val Gly Ala 
35 40 45 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
30 ' 50 55 60 

Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu 
65 70 75 

Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Lys Ala He 
„ . . . 8Q 85 90 

35 Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 

95 100 1°5 

(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 105 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 55 : 

5 Gly Gly Gly Asp Met Lys Asp Asn Trp Arg Ser Lys Leu Tyr 

1 .5 1Q . 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr 

20. .25 ... 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly 
10 35 4Q 

lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 

5.0 ... ,.. ...... ,55 

Met Gly Ala Ala Ser lie Thr Leu Met Val Gin Ala Arg Gin 
65 70 

15 Leu Ser Gly He Val Gin Gin Gin Arg Asn Leu Leu Arg Ala 

80 „ 85 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He 
95 100 

(2) INFORMATION FOR SEQ ID NO : 56 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: Amino Acid 
{D) TOPOLOGY; Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

25 Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 

2Q 25 ■ 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie 
30 35 40 45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
SO 55 . 60 

Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 
.,65 70 75 

35 Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 

80 85 30 

Ala Gin Gin 

93 ; :■■ 

(2) INFORMATION FOR SEQ ID NO: 57; 
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. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Oly'Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
■ 1 5 .10 15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 25 30 

10 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 

35 40 45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 . 60 

Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Leu Leu Leu 
15 ~ "* 65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 35 90 

-.- Ala Gin Gin His 

94 

20 (2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:5S: 

Gly Gly Gly Asp Met Arg Asp Asn Trp- Arg Ser Glu Leu Tyr Lys • 
x 5 10 15 

Tyr Lys "Val' Val Arg Ile" : Glu Pro Leu Gly lie Ala Pro Thr Arg 
20 25 30 

30 Ala Lys Arg Arg Glu Val Gin Arg Glu Lys Arg Ala Val Gly Thr 

35 40 45 

Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 60 

Met Gly Ala Ala Ser Val Ala Leu Thr Val Pro Leu Arg Arg He 
35 * 65 70 75 

Arg Ser Cys Xaa 
79 

(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 7 9 amino acids 
{B} TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 59: 

5 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
x ■ * 5 10 15 

Tvr Lys Val Val Arg He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Lys Arg Arg Val Val Gin Gly Glu Lys Arg Ala Val Gly Thr 
10 1 J " 35 40 45 

He Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 55 . 60 

Met Gly Ala Arg Ser lie Thr Leu Thr Val Pro Leu Arg Arg He 
65 70 75 

15 Arg Ser Cys Xaa 

79 

(2} INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 9 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

25 Tyr Lys Val Val Arg He Glu Pro Leu Gly Val Ala Pro Thr Lys 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Thr 
35 40 4 5 

He Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
30 "~ " 50 55 SO 

Met Gly Ala Ala Ser He Thr Leu Thr Val Pro Val Arg Arg He 

65 70 ; 75 

Arg Ser Cys Xaa 
79 

35 (2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 55 amino acids 
(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



-198- 



9820096A1_1_> 



WO 98/20036 PCT/US97/20069 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 * " 5 10 15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg 
5 " ~ 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg . Glu Lys Arg Ala Val Gly Thr 
35 40 45 

lie! Gly Ala Met Phe Leu Gly Phe Leu Gly 
50 55 

10 (2) INFORMATION FOR SEQ ID NO: €2: 

fi) SEQUENCE CHARACTERISTICS: 

J (A) LENGTH: 55 amino acids 

} (B) TYPE: Amino Acid 

■ (D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

i .5 10 .. .. 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 25 30 

20 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala He Gly Thr 

35 40 45 

He Gly Ala Leu Phe Leu Gly Phe Leu Gly 
50 55 

(2 ) 4 INFORMATION FOR SEQ ID NO: 63: 

25 U) SEQUENCE CHARACTERISTICS: 

1 (A) LENGTH: 1 54 amino acids 

j (B) TYPE: Amino Acid 

\ (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

30 Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
jL 5 10 15 

4 . .. 
i 

- Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 

| 20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Thr Met 
35 ~ 35 40 45 

4 - 
Gly Ala Leu Phe Leu Gly Phe Leu Gly 
50 54 

(2} : - INFORMATION FOR SEQ ID NO : 64 : 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 amino acids 
.(B) TYPE: Amino Acid . . 
(D) TOPOLOGY: Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
± 5 " 15 

Tvr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 25 30 

10 Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu 

3 5 4 0 45 . 

Gly Ala Met Phe Leu Gly Phe Leu Gly 
50 54 

(2) INFORMATION FOR SEQ ID NO: 65: 

15 (i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 41 amino acids 
■(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

20 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 " 5 10 15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Thr 
20 25-30 

Ala Lys Arg Arg Val Met Gin Arg Glu Lys Arg 
25 ~ 35 40 41 

{2) INFORMATION FOR SEQ ID NO: 66; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE : Amino Acid 
30 (D) TOPOLOGY: Linear 

■ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

10 15. 



35 



1 5 



Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Thr 

20 .... 25 " 30 

Ala Lys Arg Arg 
34 

(2) INFORMATION FOR SEQ ID NO: 67: 
(i.) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 270 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

5 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 * 5 10 15 

Tvr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Xaa 
10 ^ 35 40 45 

Leu Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
50 5-5 60 

Met Gly Ala Ala Ser lie Thr Leu Thr Val Gin Ala Arg Gin Leu 
65 70 75 

15 Leu Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He 

30 85 90 

Glu Ala Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly He Lys 
95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 
20 HO US 120 

Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu lie Cys 
125 130 135 

Thr Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Xaa 
140 145 150 

25 Gin Ser Xaa He Trp Asp Asn Met Thr Trp Leu Gin Trp Asp Lys 

- - - 155 160 "165 

Glu lie Ser Asn Tyr Thr Xaa He He Tyr . Asn Leu He Glu Glu 
— 170 175 ISO 

Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu 
30 185 190 195 

Asp Lys Trp Ala Asn Leu Trp Asn Trp Phe Asp He Ser Asn Trp 
200 205 210 



Leu Trp Tyr lie Xaa He Phe He Met He Val Gly Gly Leu He 

215 220 225 

35 Gly Leu Arg He Val Phe Ala Val Leu Ser He He Asn Arg Val 

230 235 240 

Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Thr Pro Asn 

245 250 255 

Pro Arg Xaa Pro Asp Arg Pro Gly Arg He Glu Glu Glu Gly Gly 
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260 265 270 

(2) INFORMATION FOR SEQ ID NO: 68: 

(I) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 268 amino acids 
5 (B) TYPE; Amino Acid 

£D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Thr Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

iO Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Pro 

20 25 30 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

Gly Ala Val Phe He Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
15 50 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65-- 70 75 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
80 85 SO 

20 Ala Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110 115 I 20 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
25 125 130 135 

Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Arg Thr Gin Ser 

...... 140 145 15Q 

Glu- lie Trp Asn Asn Met Thr Trp Leu Gin Trp Asp Lys Glu He 

155 160 165 , 

30 Ser Asn Tyr Thr Asp He He Tyr Asn Leu He Glu Glu Ser Gin 

170 175 . 180 

He Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Ala Leu Asp Lys 
185 150 195 

Trp Ala Asn Leu Trp Asn Trp Phe Asp He Ser Lys Trp Leu Trp 
35 200 205 210 

Tvr He Arq He Phe He Met He Val Gly Gly Leu He Gly Leu 
Y • 215 ■ 220 225 

Arg He Val Phe Ala Val Leu Ser He He Asn Arg Val Arg Gin 

■ ■ 230 ■ 235 . -240 

-202- 



BNSDOCIO: <WO 982Q036A1J_> 



WO 98/20036 PCT/US9 7 /2O069 

Glv Tyr ser Pro Leu Ser Phe Gin He His Thr Pro Asn Pro Arg 
245 250 255 

Gly Pro Asp Arg Pro Glu Arg He Glu Glu Glu Gly Gly 
260 . 265 268 

5 (2) INFORMATION FOR SEQ ID NO -.69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE; Amino Acid 
(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Lys Ser Glu Leu Tyr Lys 
1 5 10 15 

Tvr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 25 30 

15 Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Leu 

35 40 45 

- Gly Ala lie Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
20 65 70 75 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
SO 85 90 

Ala Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

25 Leu Gin Ala Arg Val Leu Ala Val Glu -Arg Tyr Leu Gin Asp Gin 

110 I 15 120 

Gin Leu Leu ' Gly -He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 135 

Thr Thr Val Pro Trp Asn ser Ser Trp Ser Asn Lys Ser Gin Glu 
30 140 145 150 

Asp He Trp Asn Asn Met Thr Trp Leu Gin Trp Glu Lys Glu He 
155 160 165 

Ser Ser Tyr Thr Gly He He Tyr Gin Leu He Glu Glu Ser Gin 
170 175 180 

35 Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu Leu Ala Leu Asp Lys 

185 190 195 

Trp Ala Asn Leu Asn Trp Phe Asn He Ser Asn Trp Leu Trp Tyr 
200 205 210 

He Arg Leu Phe Val He He Val Gly Gly Leu He Gly Leu Arg 

-203- 



BNSDOCID: <WO 9B2003SA1J_> 



* WO 98/20036 PCT/US97;20069 

2X5 220 225 

He Val Phe Thr Val Leu Ser He lie Asn Arg Val Arg Gin Gly 
230 235 240 

Tyr Ser Pro Leu Ser Phe Gin Thr Leu Ala Pro He Pro Glu Gly 
5 245 250 255 

Leu Gly Arg Pro Gly Arg He Glu Glu Glu Gly Gly 
260 265 267 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 268 amino acids 

(B> TYPE t Amino Acid 
(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
15 1 ^5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 3 0 

Ala Lys Arg Arg Val Val Ala Arg Glu Lys Arg Ala He Gly Met 
35 40 45 

20 Gly Ala Phe Phe Leu Gly Phe Leu Gly Ala Ala . Gly Ser Thr Met 

50 ' 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Arg Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
25 * 80 85 90 

Ala Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly He Lys Gin 
95 • 100 105 

Leu Gin Ala Arg lie Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110 115 120 

30 Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys He He Cys Pro 

125 130 135 

Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Gin Ser 
140 145 150 

Asp He Trp Asp Lys Met Thr Trp Leu Glu Trp Asp Lys Glu Val 
35 *" 155 16 0 .165 

Ser Asn Tyr Thr Gin Val He Tyr Asn Leu He Glu Glu Ser Gin 

• 170 . 175 ISO . 

Thr Gin Gin Glu He Asn Glu Arg Asp Leu I^eu Ala Leu Asp Lys 
- 185 190 1?5 
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Trp Ala Asn Leu Trp Asn Trp Phe Asp He Ser Asn Trp Leu Trp 
200 205 210 

Tyr lie Lys lie Phe Xle Met He Val Gly Gly Leu He Gly Leu 
215 . 220 225 

5 Arg He Val Phe Ala Val Leu Ser He He Asn Arg Val Arg Gin 

230 235 240 

Glv Tyr Ser Pro Leu Ser Phe Gin Thr Leu Thr His His Gin Arg 
245 250 255 

Glu Pro Asp Arg Pro Glu Arg He Glu Glu Gly Gly Gly 
10 ~ 260 265 268 

(2) INFORMATION FOR SEQ ID NO : 71 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 268 amino acids 

(B) TYPE: Amino Acid 
15 (t)) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
x ' 5 10 15 

. Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Ser Arg 
20 ^ 20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

Gly Ala Val Phe Leu -Gly Phe Leu Gly Ala Ala . Gly-Ser Thr Met 
SO 55 SO 

25 Gly Ala Ala Ser He -Thr Leu Thr Ala Gin Ala Arg Gin Leu. Leu 

65 ■ 70 75 

._ ser Gly lie Val Gin Gin Gin Ser Asn .Leu Leu. Arg Ala lie Glu 

SO 85 90 

Ala Gin Gin His Met Leu Lys Leu Thr Val Trp Gly He Lys Gin 
30 95 100 105 

Leu Gin Ala Arg . Val Leu Ala Val Glu Arg. Tyr Leu Lys Asp Gin 
110 H5 12 0 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 l^O 135 

35 Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Met Asn 

140 145 150 

Glu lie Trp Asp Asn Met Thr Trp Leu Gin Trp Asp Lys Glu He 
155 ISO 165 

Ser Asn Tyr Thr Gin He. He Tyr Asn Leu lie Glu Glu Ser Gin 
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170 



175 180 



Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu Asp Lys 
X85 190 195 

Trp Ala Ser Leu Trp Asn Trp Phe Asp He Ser Arg Trp Leu Trp 
5 200 205 . . 210 

Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu 
215 220 225 

Arg He Val Phe Ala Val Leu Ser Val He Asn Arg Val Arg Gin 
230 235 240 

10 Glv Tyr Ser Pro Leu Ser Phe Gin He Arg Thr Pro Asn Pro Lys 

245 250 255 

Glu Pro Asp Arg Leu Gly Arg He Asp Gly Glu Gly Gly 
260 265 268 

(2) INFORMATION FOR SEQ ID NO: 72: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: Amino Acid 
(D> TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

20 Gly Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Met 
25 ' 35 40 45 

Gly Ala Val Phe lie. Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Glv Ala Ala Ser He Thr Leu Met Val Gin Ala Arg Gin Leu Leu 
65 70 75 

30 Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 

80 B5 90 

Ala Gin Gin His Leu Leu Arg Leu Thr Val" Trp Gly He Lys Gin 
95 100 IO 5 

Leu Gin Ala Arg val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin 
35 110 115 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
125 130 .135 

Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Tyr Ser 

' 140 145 150 

-206- 



9920036A1J > 



WO 98/20036 



PCT/US97/20069 



Glu lie Trp Asp Asn Met Tto Trp Leu Gin Trp Asp Lys Glu lie 
155 I* 0 165 



Asn 



Asn Tyr Thr Glu Leu He Tyr Ser Leu He Glu Asp Ser Gin 



170 



X 75 180 



5 Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu Asp Lys 

185 19° 195 

Trp Ala Asn Leu Trp Asn Trp Phe Asp He Ser Asn Trp Leu Trp 
200 205 210 



10 



15 



Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu 
215 220 225 

Arg lie He Phe Ala Val Leu Ser He He Asn Arg Val Arg Gin 
230 235 240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Thr Pro Asn Pro Arg 
245 250 255 

Gly Leu Asp Arg Pro Gly Arg He Glu Glu Glu Gly Gly 
260 265 258 



(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:73: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
! 5 10 

25 Tyr Lys Asp Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 



20 



25 



Ala Lys Arg Arg val- Val- Glu -Arg Glu Lys Arg Ala Val Gly Leu 
35 40 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
30 50 55 60 

Gly Ala- Ala Ser lie Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
2 _.- nt\ '5 



6 5 70 

Ser 



Gly He Val Gin Gin Gin Ser Asn Leu Leu Met Ala He Glu 
80 95 90 



35 



Ala Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly He Lys Gin 
95 100 1° 5 

Gin Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gin 



Leu 

110 



115 120 



Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
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125 130 



135 



Thr Thr Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Ser 
140 I 45 150 

Asp He Trp Asp Asn Met Thr Trp Leu Gin Trp Asp Lys Glu He 
5 155 160 165 

Ser Asn Tyr Thr Lys He He Tyr Ala Leu He Glu Glu Ser Ala 
170 * 175 180 

Asn Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu Asp Lys 
185 190 x95 

10 Trp Thr ser Leu Trp Ser Trp Phe Asp He Thr Lys Trp Leu Trp 

200 205 210 

Tyr lie Arg He Phe He Met lie Val Gly Gly Leu ile.GlyLeu 
215 220 225 

Arg He Val Phe Ala Val Leu Asn He He Asn Arg Val Arg Gin 
15 " 230 235 240 

Gly Tyr Ser 
243 

(2} INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 104 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
25 1 5 . 10 15 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 
20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Leu 
35 40 45 

30 Gly Ala Val Phe He Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 • 5.5 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
35 80 85 90 

Ala Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly He Lys 
95 100 104 

(2) INFORMATION FOR SEQ ID NO -.75: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Gly Gly Gly Asp Met Arg Asp. Asn Trp Arg Ser Glu Leu Tyr Lys 
.15 10 15 

Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Ser Arg 
20 25 30 

10 Ala Lys Arg Arg Val Val Trp Arg Glu Lys Arg Ala Val Val Glu 

35 40 . 45 

lie Gly Ala- Val Phe Leu Gly Phe Leu 

■ :j - 50 54 

(2) INFORMATION FOR SEQ ID NO: 76: 

15 fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 263 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY^: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

20 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 ■ 15 

Tyr Lys Val Val Glu lie Lys Pro Leu Gly Val Ala Pro Thr Xaa 
20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Xaa 

25 *■ 35 40 45 

lie Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 

■ " ' io 55 €0 

Met Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu 
j ~ 65 70 75 

30 Leu Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He 

80 85 . SO 

Glu Ala Gin Gin His Xaa Leu Gin Leu Thr Val Trp Gly He Lys 
95 100 105 

Gin Leu Gin Thr Arg Val Leu Ala He Glu Arg Tyr Leu Lys Asp 
35 no H5 120 

■ Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu He Cys 

125 130 135 

Thr Thr Xaa Val Pro Trp Asn Ser Ser Trp Ser Asn Arg Ser Gin 
140 145 150 
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Thr Asp He Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg Glu 
155 160 3-^5 

He Ser Asn Tyr Thr Asp Tlir He Tyr Arg Leu Leu Glu Asp Ser 
170 1^5 180 

5 Gin Asn Gin Gin Glu Arg Asn Glu Lys Asp Leu Leu Ala Leu Asp 

185 190 195 

Ser Trp Lys Asn Leu Trp Asn Trp Phe Ser He Thr Asn Trp Leu 
200 205 2X0 

Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly 
10 ' ~ 215 220 225 

Leu Arg He He Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Thr Pro Asn Pro 
245 250 255 

15 Arg Gly Pro Asp Arg Leu Gly Arg He Glu Glu Glu Gly Gly 

260 265 269 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 196 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1^5 10 I* 

25 Tyr Lys Val Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Lys 

20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
30 * 50 55 60 

Glv Ala Ala Ser Val Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 " -70 75 

Phe Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
80 85 ^O 

35 Ala Gin His Gly Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

Leu Gin Thr Arg Val Leu Ala He Glu Arg Tyr Leu Lys Asp Gin 
110 H5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
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125 



130 3-35 



Thr Ala Val Ala Trp Asn Ser Ser Trp Ser Asn Lys Ser Gin Ser 
140 145 150 

Asp He Trp Asp Asn Met Thr Trp Met Glu Trp Asp Arg Glu He 
5 " 155 160 155 

Ser Asn Tyr Thr Asp He He Tyr Lys Leu Leu Glu Asp Ser Gin 
170 3.75 190 

Asn Gin Gin Glu Lys Asn Glu Lys Asp Leu Leu Ala Leu Asp Ser 
X85 190 195 

10 Trp Lys Asn 
19S 

(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 198 amino acids 
15 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 78: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

1 5 10 • I 5 ;• 

20 Tyr Lys Val Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Glu 

20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He 
35 40 45 

Gly Ala Val Phe- Leu Gly Phe Leu Gly Ala Ala Gly„Ser Thr Met 
25 50 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 

65 : 70 75 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
80 85 50 

30 Ala Arg Gin Gly Met Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 I 05 

Leu Gin Ala Arg Val Leu Ala He Glu Arg Tyr Leu Gin Asp Gin 
110 115 120 

Gin Leu Leu Gly Leu Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
35 125 130 135 

Thr Thr Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Lys Thr 
140 145 150 

Asp He Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg Glu He 
155 160 I 55 
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Ser Asn Tyr Thr Asp Thr .lie Tyr Lys Leu Leu Glu Asp Ser Gin 
170 175 180 

Asa Gin Gin Glu Lys Asn Glu Lys Asp Leu Leu Ala Leu Asp Ser 
185 . 190 195 

5 Trp Asn Asn 
198 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 
10 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

Gly Gly Gly Asp Met Arg Asn Asn Trp Arg Ser Glu Leu Tyr Lys 
. ! " 5 10 15 

15 Tyr Lys Val Val Glu lie Lys Pro Leu Gly Val Ala Pro Thr Thr 

20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly lie 
35 40 45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
20 50 55 60 

Gly Ala Ala Ser Met Thr Leu Thr val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala lie Glu 
BO 85 S>0 

25 Ala Gin Gin Gly Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

Leu Gin Thr Arg Val Leu Ala He Glu Arg Tyr Leu Lys Asp Gin. 

110 H5 120 

i 

Gin Leu Leu Gly He Trp Gly Arg Ser Gly Lys Leu He Cys Thr 
30 125 130 135 

Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Arg Ser Gin Thr 
• - 140 145 150 

Asp He Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg Glu He 
155 160 165 

35 Ser Asn Tyr Thr Asp Thr He Tyr Arg Leu Leu Glu Asp Ser Gin 

170 175 180 

Asn Gin Gin Glu Arg Asn Glu Lys Asp Leu Leu Ala Leu Asp Ser 
185 19° 195 

Trp Lys Asn 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 
5 (B) TYPE: Amino Acid 

CD) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Gly Gly Gly Glu Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 " 5 10 15 

10 Tyr Lys Val Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Thr 

20 25 30 

Ala Lys Arg Arg val Val Glu Arg Glu Lys Arg Ala Val Gly He 
35 40 ' .45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
, 5 50 SS 60 

Glv Ala Ala Ser Met Thr Val Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly lie Val Gin Gin Gin ser Asn Leu Leu Arg Ala He Glu 
80 85 90 

20 Ala Gin Gin Gly Leu Leu Gin Leu Thr He Trp Gly He Lys Gin 

95 100 l° s 

Leu Gin Ala Arg Val Leu Ala He -Glu Arg Tyr Leu Lys Glu Gin 
H0 3-15 120 

. Gin - Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
25 ~ ' 125 130 135 

Thr Thr val Pro Trp Asn Ser Ser Trp Ser Asn Arg Ser Gin Thr 
140 145 .- 150 

Asp He Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg Glu He 
155 160 165 

30 Ser Asn Tyr Thr Glu Thr He Tyr Arg Leu Leu Glu Asp Ser Gin 

170 175 180 

Asn Gin Gin Glu Arg Asn Glu Lys Asp Leu Leu Ala Leu Asp Ser 
185 190 ■ 195 

Trp Lys Asn 
35 198 

(2) INFORMATION FOR SEQ ID NO : 81 : 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 198 amino acids 

(B) TYPE: Amino Acid 
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(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
15 10 15 

5 Tyr Lys Val Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Thr 

20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Leu 
35 - 40 45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
10 - 50 55 60 

Gly Ala Ala Ser lie Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 .75 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
80 85 90 

15 Ala Gin Gin Gly Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

Leu Gin Thr Arg Val Leu Ala He Glu Arg Tyr Leu Lys Asp Gin 
110 . 115 120 

Gin Leu Leu Gly Met Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
20 * 125 130 135 

Thr Ala Val Pro Trp Asn Ser Ser Trp Ser Asn Arg Ser Gin Thr 
140 145 150 

Asp He Trp Asp Asn Met Thr Trp Met Gin Trp Asp Arg Glu He 
155 160 165 

25 Ser Asn Tyr Thr Asn Thr He Tyr Arg Leu Leu Glu Asp Ser Gin 

170 175 180 



Asn Gin Gin Glu Arg Asn Glu Lys Asp Leu Leu Ala Leu Asp Ser 
185 190 195 

Trp Lys Asn 
30 198 

(2) INFORMATION FOR SEQ ID NO:B2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Asn Glu Leu Tyr Lys 
1 1 5 10 . 15 
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Tvr Lys Val Val Glu He Lys Pro Leu Gly He Ala Pro Thr Gly 

25 3 0 



20 



Ser Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He 
35 m 40 45 

5 Gly Ala Val Leu Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 .55 6 0. 

Ala Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
10 80 35 SO 

Ala Gin Gin Gly Met Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Thr Arg Val Leu Ala He Glu Arg Tyr Leu Lys Asp Gin 
110 X15 120 

15 Gin 
121 

(2) INFORMATION FOR SEQ ID NO: 83: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 72 amino acids 
20 <B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 



Gly Gly Gly Asp Met Arg Asn Asn Trp Arg Ser Glu Leu Tyr Lys 
1 ' 5 10 15 

25 Tyr Lys Val Val Glu He Lys Pro Leu Gly Val Ala Pro Thr Thr 

20 25 ^ 

Pro Lys "Arg Arg Val Val Glu- Arg Glu Lys Arg Ala Val Gly He 

40 45 



35 

Glv Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
30 50 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Pro Leu Arg 
' 65 70 72 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 269 amino acids 

(E) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
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• 1 5 10 15 - 

Tyr Lys Val Val Arg He Glu Pro Leu Gly Xaa Ala Pro Thr Xaa 
20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Leu 
5 ' 35 40 45 

Gly Ala Xaa Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

10 Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 

80 35 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
15 110 115 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys His He Cys Thr 
125 130 135 

Thr Xaa Val Pro Trp Asn Ser Ser Trp Ser Asn Arg Ser Leu Asp 
140 145 150 

20 Glu He Trp Gin Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 

155 160 1S5 

Asp Asn Tyr Thr Gly Leu He Tyr Ser Leu He Glu Glu Ser Gin 
170 175 180 

He Gin Gin Glu Lys Asn Glu Lys Glu Leu Leu Glu Leu Asp Lys 
25 185 130 I 95 

Trp Ala Ser Leu Trp Asn Trp Phe Ser He Thr Lys Trp Leu Trp 
200 205 210 

Tyr He Lys He Phe He Met He Val Gly Gly Leu He Gly Leu 
215 220 225 

30 Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin 

230 235 240 

Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala Pro Arg 
245 250 255 

Gly Xaa Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly 
35 260 26S 269 

(2) INFORMATION FOR SEQ ID NO: 85: 

<i) SEQUENCE CHARACTERISTICS: .... 

(A) LENGTH: 198 amino acids 

(B) TYPE:. Amino Acid 
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"(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 85: 

Gly Gly Gly Asp Met Lys Asp Asn Trp Arg Asn Glu Leu Tyr Lys 
1 J 5 10 15 

5 Tyr Lys Val Val Arg He Glu Pro Leu Gly He Ala Pro Thr Arg 

20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Leu 
35 . 40 45 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
10 50 55 60 

Gly Ala Val Ser Val Ala Leu Thr Gly Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

15 Ala Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Ser Tyr Leu Lys Asp Gin 
110 H5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys His He Cys Thr 
20 125 130 135 

- Thr Thr Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Leu Glu 

140 145 150 



25 



Glu lie Trp Asn Asn Met Thr Trp He Glu Trp Glu Arg Glu He 
15 5 160 16 5 



Asp Asn Tyr Thr Gly Val He Tyr Ser Leu He Glu Asn Ser Gin 
170 175 180 

He Gin Gin Glu Lys Asn Glu Gin Asp Leu Leu Gin Leu Asp Lys 
185 190 195 

•Trp Ala Ser 
30 198 

(2) INFORMATION FOR SEQ ID NO:86: - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 

(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
.15 10 iS 



-217- 



BNSDOCID: <WO 9820036A1„I_> 



WO 98/20036 PCT/US97/20069 

Tvr Lys Val Val Lys He Glu Pro lie Gly Val Ala Pro Thr Lys 
20 25 30 

Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala lie Gly Leu 
35 . 40 45 

5 Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 55 60 

Glv Ala Ala Ser Val Thr Leu Thr Val Gin Ala Arg Gin Leu Met 
65 70 75 

Ser Gly He Val His Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
10 J 90 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu Gin Ala Arg Val " Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin 
110 115 12 0 

15 Gin Leu Leu Gly He Trp Gly Cys Ser Gly Arg His He Cys Thr 

125 130 135 

Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Arg Ser Leu Asp 
140 1^5 150 

Glu He Trp Gin Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 
20 " 155 160 165 

Asp Asn Tyr Thr Gly Leu He Tyr Ser Leu He Glu Glu Ser Gin 
170 175 180 

He Gin Gin Glu Lys Asn Glu Lys Glu Leu Leu Glu Leu Asp Lys 
185 1^0 195 

25 Trp Ala Ser 
198 

(2) INFORMATION FOR SEQ ID NO: 87: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 
30 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Gly Gly Gly Asp Met Arg Asp Asn Trp He Ser Glu Leu Tyr Lys 
1 * ^ 5 10 15 

35 Tyr Lys Val Val Arg He Glu Pro Leu Gly Val Ala Pro Thr Lys 

20 25 30 . 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Leu 
35 - 40 45 

Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
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50 S5 60 

Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 



80 



85 9° 



10 



Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Levi Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gin Asp Gin 
110 H5 120 

Arg Leu Leu Gly Met Trp Gly Cys Ser Gly Lys His He Cys Thr 
12S 130 135 

Thr Phe Val Pro Trp Asn ser Ser Trp Ser Asn Arg Ser Leu Asp 
i 140 145 150 

j 

Asp He Trp Asn Asn Met Thr Trp Met Gin Trp Glu Lys Glu He 
15 ~ 155 1^0 165 

Ser Asn Tyr Thr Gly He He Tyr Asn Leu He Glu Glu Ser Gin 
j 17I0 175 180 

lie Gin Gin Glu Lys Asn Glu Lys Glu Leu Leu Glu Leu Asp Lys 
! lis 190 195 

20 Trp Ala Ser 
198 

(2); INFORMATION FOR SBQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 
■■"! (A> LENGTH : *198 amino acids 
25 ! (B) TYPE: Amino Acid 

(D) TOPOLOG^: Linear 

• i i 

Tpci ) SEQUENCE DESCRIPTION i SEQ ID NO : 8 8 : 
■ GLy Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

30 Tyr Lys Val Val Gin He Glu Pro Leu Gly Val Ala Pro Thr Arg 

20 25 30 

„. .. 5 • j 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Leu 
35 40 4 5 

Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
35 50 55 60 

i ! 

Gly Ala Arg Ser vkl Thr Leu Thr Val Gin. Ala Arg Gin Leu Met 
65 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 
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Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly Xle Lys Gin 
95 100 105 

Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110 115 120 

5 Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys His He Cys Thr 

X25 13 0 135 

Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Arg Ser Leu Asn 
140 145 150 

Glu He Trp Gin Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 
10 155 160 165 

Asp Asn Tyr Thr Gly Leu He Tyr Ser Leu He Glu Glu Ser Gin 
170 175 180 

Thr Gin Gin Glu Lys Asn Glu Lys Glu Leu Leu Glu Leu Asp Lys 
185 190 195 

15 Trp Ala Ser 
198 

(2) INFORMATION FOR SEQ ID NO; 09; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 198 amino acids 
20 <B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

25 Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Arg 

20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Leu 
35 40 45 

Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
30 ' 50 55 60 

Gly Ala Arg Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 

.65 . 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

35 Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110 H5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr 
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125 



130 135 



Thr Thr Val Pro Trp Asn Ser Ser Trp Ser Asn Arg Ser Leu Asn 
140 145 ISO 

Asp He Trp Gin Asn Met Thr Trp Met Glu Trp Glu Arg Glu lie 
5 ~ 155 160 165 

Asp Asn Tyr Thr Gly Leu lie Tyr Arg Leu lie Glu Glu Ser Gin 
170 175 180 

Thr Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys 
185 190 . 1^5 

10 Trp Ala Ser 
198 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IBS amino acids 
15 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

. Gly Gly Gly Asp Met Arg Asp Asn Trp Arg. Ser Glu Leu Tyr Lys 
1 5 - . 10 15 

20 Tyr Lys Val lie Lys lie Glu Pro Leu Gly Leu Ala Pro Thr Arg 

20 25 30 

Ala Lys Arg Arg Val Val Ala Arg Glu Lys Arg Ala He Gly Leu 
35 40 45 

" Gly Ala Val_Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

25 ' " ' 50 55 60 

Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Met 

,„-.,..„ 65 - .. 70 75 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

30 Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

Leu Gin Ala Arg Val Leu Ala Val Glu Ser Tyr Leu Lys Asp Gin 
110 H5 120 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Arg His lie Cys Pro 
35 125 130 135 

Thr Gin Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Leu Asp 
140 145 150 

Thr He Trp Gly Asn Met Thr Trp Met Glu Trp Glu Arg Glu He 
155 160 165 
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Ser Asn Tyr Thr Gly Leu He Tyr Asp Leu He Glu Glu Ser Gin 
170 175 180 

He Gin Gin Glu Lys Asn Glu Lys Asp Leu Leu Glu Leu Asp Lys 
X85 190 195 

5 Trp Ala Ser 
198 

(2) INFORMATION FOR SEQ ID NO: 91: 

( i ) SEQUENCE CHARACTERI S TI CS : 

(A) LENGTH: 104 amino acids 
10 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Asn Glu Leu Tyr Lys 
! 5 10 15 

15 Tyr Lys Val Val Arg He Glu Pro Leu Gly Leu Ala Pro Thr Lys 

20 25 30 

Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Leu 
35 40 45 

Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
20 * 50 55 60 

Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
55 70 75 

Ser Glv He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu 
80 85 90 

25 Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 

95 100 104 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 70 amino acids 
30 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

35 Tyr Lys Val Val Arg He Glu Pro Leu Gly He Ala Pro Thr Met 

20 25 30 

Ser Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Leu 
35 40 45 

Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
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50 



55 60 



Gly Ala Ala Thr Leu Thr Leu Thr Val Xaa 
65 70 

{2) INFORMATION FOR SEQ ID NO: 93: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 93: 

10 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 1° 15 

Tyr Lys Val Val Arg lie Glu Pro Leu Gly Leu Ala Pro Thr Glu 
20 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala He Gly Leu 
15 ' 35 40 45 

Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 : 60 

Gly Ala Ala Ser Met Thr Leu Thr Val Xaa 
65 70 

20 (2) INFORMATION FOR SEQ ID NO : 94 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE' DESCRIPTION: SEQ ID NO : 94 : 

Gly Gly Gly Asp Met Arg Asp Asn Arg Arg Ser Glu Leu Tyr Lys 
1 ' * J " 5 10 15- 

Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
20 25 30 

30 Thr Lys Arg Arg Val Val Glu Arg Glu Glu Arg Ala He Gly Leu 

35 40 45 

Gly Ala Met Phe Leu Gly Phe Leu 
50 53 

(2) INFORMATION FOR SEQ ID NO: 95: 

35 (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION.: SEQ ID NO: 95: 
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Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 .15 

Tyr Lys Val Val Arg lie Glu Pro Leu Gly He Ala Pro Thr Met 
20 25 30 

5 Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg 

35 40 41 

(2)' INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 
10 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Gly Gly Gly Asn He Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 l^ 

15 Tyr Lys Val Val Gin He Glu Pro Leu Gly He Ala Pro Thr Arg 

20 25 . 30 

. Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Xaa 
35 40 .45 

He Gly Ala Met He Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr 
20 50 55 SO 

Met Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu 

. 65 70 75 

Leu Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He 
80 85 90 

25 Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 

95 100 105 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp 
110 115 120 

Gin Lys Phe Leu Gly Leu Trp Gly Cys Ser Gly Lys He He Cys 
30 125 130 135 

Thr Thr Ala Val Pro Trp Asn Ser Thr Trp Ser Asn Arg Ser Phe 
. X40 145 150 

Glu Glu He Trp Asn Asn Met Thr Trp lie Glu Trp Glu Arg Glu 
155 160 165 

35 He Ser Asn Tyr Thr Asn Gin He Tyr Glu He Leu Thr Glu Ser 

170 175 180 

Gin Asn Gin Gin Asp Arg Asn Glu Lys Asp Leu Leu Glu Leu Asp 
185 190 195 

Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp lie Thr Ash .Trp Leu 
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200 



205 210 



Trp Tyr lie Lys He Phe He Met He Val Gly Gly Leu He Gly 
215 220 225 

Leu Arg He He Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
5 " 230 235 240 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Pro xaa His His Gin 
245 250 255 

Arg Glu Pro Asp Arg Pro Glu Arg He Glu Glu Gly Gly Gly 
260 265 269 

10 (2> INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Gly Gly Gly Asn He Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
1 5 10 15 

Tvr Lys Val Val Gin He Glu Pro Leu Gly He Ala Pro Thr Arg 
20 25 30 

20 Ala Lys Arg Arg val Val Glu Arg Glu Lys Arg Ala val Gly lie 

35 40 45 

Gly Ala Met He Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
50 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu Val 
25 65 ' 70 75 

Ser Gly Tie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
80 8S 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 1° 5 

30 Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 

110 US 120 

Lys Phe Leu Gly Leu Trp Gly Cys Ser Gly Lys lie lie Cys Thr 
12S 130 135 

Thr Ala Val Pro Trp Asn Ser Thr Trp Ser Asn Arg Ser Phe Glu 
35 140 145 150 

Glu lie Trp Ser Asn Met Thr Trp He Glu Trp Glu Arg Glu He 
155 160 . I 65 

Ser Asn Tyr Thr Asn Gin He Tyr Glu He Leu Thr Glu Ser Gin 
170 175 180 
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Asn Gin Gin Asp Arg Asn Glu Lys Asp Leu Leu Glu Leu Asp Lys 

185 190 195 

Trp Ala Ser 
198 

5 (2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

10 {xi) SEQUENCE DESCRIPTION: SEQ ID NO : 98 : 

Gly Gly Gly Asn lie Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

1 5 10 15 

Tyr Lys Val Val Gin lie Glu Pro Leu Gly He Ala Pro Thr Arg 

20 25 30 

15 Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He 

35 40 45 

Gly Ala Met He Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr Met. 

50 55 60 

Gly Ala Ala Ser lie Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 

20 65 70 75 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 

80 85 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

25 Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 

110 H5 120 

Lys Phe Leu Gly Leu Trp Gly Cys Ser Gly Lys He He Cys Thr 

125 130 135 

Thr Ala Val Pro Trp Asn Ser Thr Trp Ser Asn Arg Ser Phe Glu 

30 140 145 150 

Glu He Trp Asn Asn Met Thr Trp Thr Glu Trp Glu Arg Glu He 

155 160 165 

Ser Asn Tyr Thr Asn Gin He Tyr Asp He Leu Thr Glu Ser Gin 

170 175 ISO 

35 Asn Gin Gin Asp Arg Asn Glu Lys Asp Leu Leu Gly Leu Asp Lys 

1B5 19° 195 

Trp Ala Ser 
198 

(2) INFORMATION FOR SEQ ID NO: 99: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 : 

Gly Gly Gly Asn lie Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

1 5 10 15 

Tyr Lys Val Val Gin' He Glu Pro Leu Gly He Ala Pro Thr Arg 

20 25 30 

10 Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly lie 

35 40 45 

Gly Ala Met He Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

50 55 60 

Gly Ala Ala Ser He Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 

15 65 70 . 75 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu 

80 85 90 

Ala Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin 

95 100 105 

20 Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 

110 H5 120 

Lys Phe Leu Gly Leu Trp Gly Cys Ser Gly Lys He He Cys Thr 

125 130 13 5 

Thr Ala Val Pro Trp Asn Ser Thr Trp Ser Asn Lys Ser Phe Glu 

„25 . 140 . 145 - 150 

Glu He Trp Asn Asn Met Thr Trp Thr Glu Trp Glu Arg Glu He 

155 160 165 

Ser Asn Tyr Thr Asn Gin lie Tyr Glu He Leu Thr Glu Ser Gin 

170 175 180 

30 Asn Gin Gin Asp Arg Asn Glu Lys Asp Leu Leu Glu Leu Asp Lys 

185 ISO 195 

Trp Ala Ser 
198 

(2) INFORMATION FOR SEQ ID NO: 100: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 amino acids 

(B) TYPE: Amino Acid ■ 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO:100: 
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Gly Gly Gly Asn lie Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

X 5 10 15 

Tyr Lys Val Val Gin lie Glu Pro Leu Gly lie Ala Pro Thr Arg 

2 0 25 3 0 

5 Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly lie 

35 40 45 

Gly Ala Met lie Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 

. 50 55 60 

Gly Ala Ala Ser lie Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 

10 65 70 75 

Ser Gly lie val Gin Gin Ser Asn Leu Leu Arg Ala Glu Ala Gin 

80 85 90 

Gin His Leu Leu Gin Leu Thr Val Trp Gly Gin Leu Gin Ala Arg. 

95 100 105 

15 Val Ala Val Glu Arg Tyr Leu Lys Asp Gin Lys Leu Gly Leu Trp 

110 115 120 

Cys Ser Gly Lys He He Cys Thr Thr Ala Val Pro Trp Asn Ser 

125 130 135 

Thr Trp Ser Asn Arg Ser Phe Glu Glu He Trp Asn Asn Met Trp 

20 140 145 150 

He Glu Trp Arg Glu He Ser Asn Tyr Thr Asn Gin He Tyr Glu 

155 160 165 

He Leu Thr Glu Ser Gin Asn Gin Gin Asp Arg Asn Glu Lys Asp 

170 175 180 

25 Leu Leu Glu Leu Asp Lys Trp Ala Ser 

185 189 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 
30 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID 110:101: 

Gly Gly Gly Asn He Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

1 5 10 15 

35 Tyr Lys Val Val Gin He Glu Pro Leu Gly He Ala Pro Thr Arg 

20 25 . . 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He 

35 40 45 

Gly Ala Met He Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
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50 55 60 

Gly Ala Ala Ser Ile Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
65 70 75 

Ser Gly He Val Gin- Gin Gin Ser Asn Leu Leu Arg Ala He Glu 
5 ^ 80 S5 90 

Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
95 100 105 

Leu ; Gln Ala Arg Val. Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
110" ' 115 120 

10 Lys Phe Leu Gly Leu Trp Gly Cys Ser Gly Lys He lie Cys Thr 

125 130 135 

Thr Ala Val Pro Trp. Asn Ser Thr Trp Ser Asn Arg Ser Leu Glu 
• j 1401 ' 145 150 

Glu He Trp Asn Asn Met Thr Trp He Glu Trp Glu Arg Glu He 
15 ~ 155 160 165 

Ser ^ Asn Tyr Thr Asn Arg He Tyr Glu He Leu Thr Lys Ser Gin 
170: 175 ISO 

Asp "Gin Gin Asp Arg Asn Glu Lys Asp Leu Leu Glu Leu Asp Lys 
] 18S 190 195 

20 Trp Ala Ser 
198 

(2) INFORMATION FOR SEQ ID NO: 102; 

it) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH: ^0 amino acids 

25 1 (B) TYPE: Amino Acid 

j {D> TOPOLOGY.; Linear 

i | •• 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:102: 
1 i • 

Gly^ Gly Gly Asn lie Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
i; S 10 15 

30 Tyr. Lys Val Val Gin lie Glu Pro Leu Gly lie Ala Pro Thr Arg 

20'* 25 30 

Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly He 
j 35 • 40 45 

Gly Ala Met lie Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
35 ~ 50 55 60 

4 .. 
Gly Ala Pro Ser lie Thr Leu Thr Val Xaa 

65 70 . 

(2) INFORMATION FOR SEQ ID NO:103: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 
(B> TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Xaa Gly Gly Asp Met Lys Asp He Trp Arg Thr Glu Leu Tyr Asn 
1 " 5 10 15 

Tvr Lys Val Val Arg lie Lys Pro Xaa Ser Val Ala Pro Thr Lys 
20 25 30 

10 Xaa Xaa Arg Pro Xaa He Xaa Xaa Xaa Xaa Xaa His Arg Xaa Lys 

35 40 45 

Arg Ala Val Gly Xaa Leu Gly Met Leu Phe Leu Gly Val Leu Ser 
50 55 _ 60 

Ala Ala Gly Ser Thr Met Gly Ala Ala Ala Thr Xaa Leu Thr Val 
15 65 70 75 

Gin Thr Xaa Xaa Leu Leu Lys Gly He Val Gin Gin Gin Asp Asn 
80 85 90 

Leu Leu Arg Ala He Xaa Ala Gin Gin His Leu Leu Xaa Leu Ser 
95 100 105 

20 Val Trp Gly Xaa Xaa Gin Leu Xaa Ala Arg Leu Leu Ala Xaa Glu 

110 H5 12 0 

Thr Xaa Leu Gin Xaa Gin Gin Leu Leu Ser Leu Trp Gly Cys Lys 
125 130 135 

Gly Lys Leu Val Cys Tyr Thr Xaa Val Xaa Trp Asn Asn Ser Trp 
2 5 140 3.45 ISO 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser Ser Xaa Xaa Xaa 
155 ISO 165 

Xaa He Trp Asp Asn Leu Thr Trp Gin Xaa Trp Asp Arg Leu Xaa - 
170 175 180 

30 Ser Asn Xaa Xaa Xaa Xaa He Tyr Xaa Glu Xaa Gin Xaa Ala Gin 

185 130 195 

■ - Xaa Gin Gin Glu Lys Asn Glu Lys Xaa Leu Leu Glu Leu Asp Glu 

200 205 210 

Trp Ala Ser 
35 213 

(2) INFORMATION FOR SEQ ID NO : 104 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 0 amino acids 

(B) TYPE: Amino Acid 
40 (D) TOPOLOGY: Linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

lie Gly Gly Asp Met Lys Asp lie Trp Arg Thr Glu Leu Phe Asn 
m± ' ^ " 5 10 15 

Tyr Lys Val Val Arg Val Lys Pro Phe Ser Val Ala Pro THr Arg 
5 20 25 30 

He Ala Arg Pro Val He Ser Thr Arg Thr His Arg Glu Lys Arg 
35 40 45 

Ala Val Gly Leu Gly Met Leu Phe Leu Gly Val Leu Ser Ala Ala 
50 55 60 

10 Gly Ser Thr Met Gly Ala Ala Ala Thr Thr Leu Ala Val Gin Thr 

65 70 75 

His Thr Leu Leu Lys Gly He Val Gin Gin Gin Asp Asn Leu Leu 
80 85 90 

Arg Ala He Gin Ala Gin Gin Gin Leu Leu Arg Leu Ser Xaa Trp 
15 95 100 105 

Gly He Arg Gin Leu Arg Ala Arg Leu Leu Ala Leu Glu Thr Leu 

. no . US I 20 

Leu Gin Asn Gin Gin Leu Leu Ser Leu Trp Gly Cys Lys Gly Lys 
125 "0 135 

20 Leu Val Cys Tyr Thr Ser Val Lys Trp Asn Arg Thr Trp He Gly 

140 145 150 

Asn Glu Ser He Trp Asp Thr Leu Thr Trp Gin Glu Trp Asp Arg 
155 160 "5 

Gin He Ser Asn He Ser Ser. Thr. He Tyr Glu Glu He Gin Lys 
25 170 175 180 

Ala Gin Val Gin Gin Glu Gin Asn Glu Lys Lys Leu Leu Glu Leu 
185 190 • X9B 

Asp Glu Trp Ala Ser 
200 

30 {2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: 

Val Gly Gly Asp Met Lys Asp lie Trp Arg Thr Lys Leu Tyr Asn 
! 5 10 15 

Tyr Lys Val Val Gin He Lys Pro Phe Ser Val Ala Pro Thr Lys 
20 25 30 
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Met Ser Arg Pro He He Asn He His Thr Pro His Arg Glu Lys 
35 40 45 

Arg Ala Val Gly Leu Gly Met Leu Phe Leu Gly Val Leu Ser Ala 
50 _ 55 60 

5 Ala Gly Ser Thr Met Gly Ala Ala Ala Thr Ala Leu Thr Val Arg 

65 70 75 

Thr His Ser Val Leu Lys Gly He Val Gin Gin Gin Asp Asn Leu 
80 8E3 SO 

Leu Arg Ala lie Gin Ala Gin Gin His Leu Leu Arg Leu Ser Val 
10 ' SB 100 105 

Trp Gly He Arg Gin Leu Arg Ala Arg Leu Gin Ala Leu Glu Thr 
110 115 12 0 

Leu He Gin Asn Gin Gin Arg Leu Asn Leu Trp Gly Cys Lys Gly 
125 130 135 

15 Lys Leu He Cys Tyr Thr Ser Val Lys Trp Asn Thr Ser Trp Ser 

140 145 150 

Gly Arg Tyr Asn Asp Asp Ser He Trp Asp Asn Leu Thr Trp Gin 
155 160 1^5 

Gin Trp Asp Gin His He Asn Asn Vai Ser Ser He He Tyr Asp 
20 " 170 175 180 

Glu He Gin Ala Ala Gin Asp Gin Gin Glu Lys Asn Val Lys Ala 
185 1^0 1^5 

Leu Leu Glu Leu Asp Glu Trp Ala Ser 
200 204 

25 (2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106; 

Thr Gly Gly Asn Met Lys Asp- He Trp Arg Ser Glu Leu Tyr Lys 
1 * 5 10 15 

Tyr Lys Val Val Arg He Glu Pro Leu Ser Val Ala Pro Thr Lys 
20 25 . 3 0 

- 35 Ala Arg Arg His Thr Val Ala Arg Gin Lys Asp Arg Gin Lys Arg 

.35 40 45 

Ala Ala Phe Gly Leu Gly Ala Leu Phe Leu Gly Phe Leu Gly. Ala 
50 55 60 

Ala Gly Ser Thr Met Gly Ala Ala Ala Val Thr Leu Thr Val Gin 
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65 



70 75 



Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu 
80 85 90 

Leu Lys Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Ser He 
5 95 100 105 

Tro Gly Val Lys Gin Leu Gin Ala Arg Leu Leu Ala Val Glu Arg 
110 H5 120 

Tvr Leu Gin Asp Gin Gin He Leu Gly Leu Trp Gly Cys Ser Gly 
125 130 135 

10 Lvs Ala Val Cys Tyr Thr Thr Val Pro Trp Asn Asn Ser Trp Pro 

140 150 

Gly Ser Asn Ser Thr Asp Asp He Trp Gly Asn Leu Thr Trp Gin 
155 160 165 

Gin Trp Asp Lys Leu Val Ser Asn Tyr Thr Gly Lys He Phe Gly 
,5 170 175 180 

Leu Leu Glu Glu Ala Gin Ser Gin Gin Glu Lys Asn Glu Arg Asp 
' . 185 19° 195 

Leu Leu Glu Leu Asp Gin Trp Ala Ser 
200 204 

20 (2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

Glu Trp Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His Ser 

1. 5 : 10 - 7 15 

Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

30 (2) INFORMATION FOR SEQ ID NO: 108: 

' ( i ) SEQUENCE CHARACTERISTICS : ? 

(A) LENGTH: 36 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Tvr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin 
1 5 10 15 

Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala 
20 25 30 
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Ser Leu Trp Asn Trp Phe 
35 36 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 46 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY; Linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 109: 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr Xaa Leu He 
10 1 5 10 15 

Tyr Thr Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu 
20 25 30 

Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 
35 40 45 

15 Phe 
46 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 46 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Trp Leu Gin Trp Asp Lys Glu He Ser Asn Tyr Thr Xaa lie He 
! 5 10 15 

25 Tyr Asn Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu 

20 25 3 0 

Gin Asp Leu Leu Ala Leu Asp Lys Trp Ala Asn Leu Trp Asn Trp 
35 40 45 

Phe 
30 46 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 46 amino acids 

(B) TYPE: Amino Acid 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 111: 

Trp Met Gin Trp Asp Arg Glu He Ser Asn Tyr Thr Asp Thr He 
! 5 10 15 

Tyr Arg Leu Leu Glu Asp Ser Gin Asn Gin Gin Glu Arg Asn Glu 
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20 



25 30 



Lys Asp Leu Leu Ala Leu Asp Ser Trp Lys Asn Leu Trp Asn Trp 
35 40 4 5 

Phe 
5 46 

(2) INFORMATION FOR SEQ IB NO: 112: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 46 amino acids 

(B) TYPE: Amino Acid 
10 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 112: 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr Gly Leu He 

1 5 * 10 r. 15 

Tyr Ser Leu He Glu Glu Ser Gin He Gin Gin Glu Lys Asn Glu 
15 20 25 30 

Lvs Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 

35 " " 40 j 45 

Phe . _ . 
46 

20 (2) INFORMATION FOR SEQ ID NO : 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 113: 

Trp He Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Asn Gin He 
1 5 10 15 

Tvr Glu He Leu Thr Glu Ser Gin Asn Gin Gin Asp Arg Asn Glu 
20 25 30 

30 Lys Asp Leu Leu Glu Leu Asp Lys Trp Ala 

35 40 ; 
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What is claimed is: 

L A method of constructing a constrained helical peptide, comprising the steps of: 

(a) synthesizing a peptide, wherein the peptide comprises a sequence of eight amino acid 
residues, wherein the sequence of eight amino acid residues has a first terminal residue and a 

5 second terminal residue, wherein the first terminal residue and the second terminal residue flank 

an internal sequence of six amino acid residues, and wherein the first and second terminal 
residues have a side chain containing an amide bond-forming substituent; 

(b) providing a difunctional linker having a first functional group capable of forming an amide 
linkage with the side chain amide bond-forming substituent of the first terminal residue and 

1 0 having a second functional group capable of forming an amide linkage with the side chain amide 

bond-forming substituent of the second terminal residue; and 

(c) cyclizing the peptide by reacting the side chain amide bond-forming substituent of the first 
terminal residue with the first functional group of the difunctional linker to form an amide linkage 
and reacting the side chain amide bond-forming substituent of the second terminal residue with 

1 5 the second functional group of the difunctional linker to form an amide linkage, yielding a 

constrained heiical peptide. 

2. The method of claim 1 wherein in step (a) the side chain amide bond-forming substituent of the 
first terminal residue is protected with a first protecting group and the side chain amide bond- 
forming substituent of the second terminal residue is protected with a second protecting group, 

20 wherein the first protecting group and the second protecting group are differentially removable, 

and wherein in step (c) the first protecting group is removed such that the side chain amide bond- 
forming substituent of the first terminal residue is deprotected and the side chain amide bond- 
forming substituent of the second terminal residue is not deprotected before the peptide is reacted 
with the difunctional linker, and thereafter the peptide is reacted with the difunctional linker to 

25 form an amide linkage between the side chain amide bond-forming substituent of the first 

terminal residue and the first functional group of the difunctional linker, and thereafter the second 
protecting group is removed from the side chain amide bond-forming substituent of the second 
terminal residue and the peptide is cyclized by intramolecuiarly reacting the side chain amide 
bond-forming substituent of the second terminal residue with the second functional group of the 

30 difunctional linker to form an amide linkage. 

3 . A method of constructing a constrained helical peptide, comprising the steps of: 
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(a) synthesizing a peptide, wherein the peptide comprises a sequence of eight amino acid - 
residues, wherein the sequence of eight amino acid residues has a first terminal residue and a 
second terminal residue, wherein the first terminal residue and the second terminal residue flank 
an internal sequence of six amino acid residues, wherein the first and second terminal residues 

5 have a side chain containing an amide bond-forming substituent, wherein the first terminal 

residue is coupled to a difunctional linker having a first functional group and a second functional 
group, wherein the first functional group is in an amide linkage with the side chain amide bond- 
forming substituent of the first terminal residue, and wherein the second functional group of the 
difunctional linker is capable of forming an amide linkage with the side chain amide bond- 

1 0 forming substituent of the second terminal residue; and 

(b) cyclizing the peptide by intramolecularly reacting the side chain amide bond-forming 
substituent of the second terminal residue with the second functional group of the difunctional 
linker to form an amide linkage and yield a constrained helical peptide. 

15 

4. A compound selected from the group consisting of: 

the compound represented by Formula (1): 

0=C-(NH>(CH 2 ) n -(NH)-C=0 

i I 
20 (CH 2 ) m (CH 2 ) p 

I I 
X-(NH)-(CH)-C Z— -(NH)-(CH)-C-Y-S , 



25 



o 

■ '- • (1) 

wherein S is absent or is a macromolecule, X is hydrogen or is any ammo acid or amino acid 



sequence, Y is absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence, 
Z is any amino acid sequence consisting of six amino acids; m and p are independently selected 
30 from the integers 0 to 6 inclusive, provided that m+p is less than or equal to 6, and n is any 

integer in the range defined by (7-(m+p)) to (9-(m+p)) inclusive, provided that n is greater than 1 ; 

the compound represented by Formula (6): 

O 

II 

H-N-C-(CH 2 ) r -(NH)-C=0 



35 



(CH 2 ) q (CH 2 ) S 

40 X-(NH)-(CH)-C--Z-(NH)-(CH)-C-Y-S 

II , H 

o o 

(6) 
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wherein S is absent or is a macromoiecute. X is hydrogen or is any amino acid or amino acid 
sequence, Y is absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence, 
Z is any amino acid sequence consisting of six amino acids, q is selected from the integers ! to 7 
inclusive, s is selected from the integers 0 to 6 inclusive, provided that q+s is less than or equal to 
5 7, and r is any integer in the range defined by (7-(q+s)) to (9-(q+s)) inclusive, provided that r is 

greater than 0; 

the compound represented by Fonnuia (11): 

o 

II 

10 OC-(NH)-(CH 2 ) u -C-N-H 

I i 
(CH 2 ) t (CH 2 ) V 

I I 
X-(NH)-(CH)-C-Z-0<fH)-(CH)-C-y-S , 



15 



o 



(11) 

wherein S is absent or is a macromolecuie, X is hydrogen or is any amino acid or amino acid 
sequence, Y is absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence, 
20 Z is any amino acid sequence consisting of six amino acids; t is selected from the integers 0 to 6 

inclusive, and v is selected from the integers 1 to 7 inclusive, provided that t+v is less than or 
equal to 7; and u is any integer in the range defined by (7-(t+v)) to (9-(t+v)) inclusive, provided 
that u is greater than 0; and 

the compound represented by Formula (16): 

25 

o o 

II 11 

H-N-C-(CH 2 ) X -C-N-H 

j I 
30 (CH 2 ) W (CH 2 ) y 

II 
X-(NH)-C-C~ Z— -N-(CH)-C-Y-S , 

I II I" II 

HO H O 

35 (16) 

wherein S is absent or is a macromolecuie, X is hydrogen or is any amino acid or amino.acid 
sequence, Y is absent, or is hydroxyl if S is absent, or is any amino acid or amino acid sequence, 
Z is any amino acid sequence consisting of six amino acids; w and y are independently selected 
from the integers 1 to 7 inclusive, provided that w+y is less than or equal to 8, and x is any 
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r 

integer in the range defined by (7-(w+y)) to (9H>+y)) inclusive, provided that x is greater-than or 
equal to 0. 

5. • The compound of claim 4 that is the" compound of Formula (1), wherein Z is Gln-Gln~Arg-Arg- 
Phe-Tyr. 

5 6. A constrained helical peptide made according to the method of claim 1 . 

7. A constrained helical peptide made according to the method of claim 3. 

8. A compound according to claim 4, wherein Z is an amino acid sequence consisting of six amino 
; acids, wherein the internal sequence of six amino acids has the form gabcde, defgab, or cdefga 
j and is selected from the group of sequences consisting of a sequence of six contiguous amino 

10 acids in HIV-tLAI strain gp41 amino acid sequence 633 to 678, in its homolog sequence from 

another HIV strain, in a consensus sequence of its homolog sequences from any one HIV clade, 
or amino acid substituted variant thereof, in which amino acid 633 or its corresponding amino 
■j acid in the homolog, consensus or variant sequence is assigned position a of a repeating abcdefg 
assignment. 

1 5 9. The compound of claim 8, further comprising S* when S is absent and X is any amino 
acid or amino acid sequence, wherein S' is a macromolecule attached to X. 

10,^ A compound comprising a first constrained helical peptide comprising a peptide 

comprising a sequence of eight amino acid residues, wherein the sequence of eight 
j amino acid residues has a first terminal residue and a second terminal residue, wherein 
20 I the first terminal residue and the second terminal residue flank an internal sequence of 

* six amino acids, wherein the first and second terminal residues have a side chain that are 
■] linked to each other forming a locking moiety to form a constrained helical peptide, 
1 wherein the internal sequence of six amino acids has the form gabcde, defgab, or cdefga 
and is selected from the group of sequences consisting of a sequence of six contiguous 
25 i amino acids in HIV- 1 LAI strain gp4 1 amino acid sequence 63 3 to 678, in its homolog 

! sequence from another HIV strain, in a consensus sequence of its homolog sequences 
from any one HIV clade, or in an amino acid substituted variant thereof, in which amino 
acid 633 or its corresponding amino acid in the homolog, consensus or variant sequence 
is assigned position a of a repeating abcdefg assignment 

i 

30 II, The compound of claims 8 or 1 0, wherein the homolog or consensus sequence is shown 
in Figures I6A-16G. 
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12. A compound of claims 8 or J 0, further comprising a second constrained helical peptide. - 

13. An antibody that binds to a compound of claim 8, wherein the antibody specifically binds an 
epitope comprising an amino acid at position" a, d, e> or g in the helical peptide, 

14. A method to prophylacticaliy or therapeutically treat a mammal at risk for or infected with HIV, 
5 comprising administering a prophylacticaliy or therapeutically effective amount of a compound of claims 

8 or 10. 

15. The method of claim 14, wherein the composition comprises internal six amino acid sequences 
from different HIV strains or HIV c Jades. 

16. A vaccine comprising at least one compound of claims 8 or 10. 
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